Pre-initialize torch._dynamo to prevent double-registration with `peft` torch.compile() call by hychiang-git · Pull Request #1228 · NVIDIA/Model-Optimizer

hychiang-git · 2026-04-09T23:00:22Z

What does this PR do?

Type of change: ? Bug fix

This PR fixes the error AssertionError: Artifact of type=precompile already registered in mega-cache artifact factory when launching with local docker.

The error message:

Qwen3-8B-0/0 Traceback (most recent call last):
Qwen3-8B-0/0   File "/nemo_run/code/modules/Megatron-LM/examples/post_training/modelopt/quantize.py", line 21, in <module>
Qwen3-8B-0/0     import modelopt.torch.quantization as mtq
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/modelopt/torch/__init__.py", line 23, in <module>
Qwen3-8B-0/0     from . import distill, nas, opt, peft, prune, quantization, sparsity, speculative, utils
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/modelopt/torch/prune/__init__.py", line 24, in <module>
Qwen3-8B-0/0     from . import fastnas, gradnas, plugins
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/modelopt/torch/prune/gradnas.py", line 71, in <module>
Qwen3-8B-0/0     from transformers.models.bert.modeling_bert import BertAttention
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/transformers/models/bert/modeling_bert.py", line 30, in <module>
Qwen3-8B-0/0     from ...generation import GenerationMixin
Qwen3-8B-0/0   File "<frozen importlib._bootstrap>", line 1412, in _handle_fromlist
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/transformers/utils/import_utils.py", line 2317, in __getattr__
Qwen3-8B-0/0     module = self._get_module(self._class_to_module[name])
Qwen3-8B-0/0              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/transformers/utils/import_utils.py", line 2347, in _get_module
Qwen3-8B-0/0     raise e
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/transformers/utils/import_utils.py", line 2345, in _get_module
Qwen3-8B-0/0     return importlib.import_module("." + module_name, self.__name__)
Qwen3-8B-0/0            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Qwen3-8B-0/0   File "/usr/lib/python3.12/importlib/__init__.py", line 90, in import_module
Qwen3-8B-0/0     return _bootstrap._gcd_import(name[level:], package, level)
Qwen3-8B-0/0            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/transformers/generation/utils.py", line 43, in <module>
Qwen3-8B-0/0     from ..masking_utils import create_masks_for_generate
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/transformers/masking_utils.py", line 40, in <module>
Qwen3-8B-0/0     from torch._dynamo._trace_wrapped_higher_order_op import TransformGetItemToIndex
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/torch/_dynamo/__init__.py", line 13, in <module>
Qwen3-8B-0/0     from . import (
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/torch/_dynamo/aot_compile.py", line 16, in <module>
Qwen3-8B-0/0     from torch._dynamo.package import SystemInfo
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/torch/_dynamo/package.py", line 443, in <module>
Qwen3-8B-0/0     @CacheArtifactFactory.register
Qwen3-8B-0/0      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Qwen3-8B-0/0   File "/usr/local/lib/python3.12/dist-packages/torch/compiler/_cache.py", line 72, in register
Qwen3-8B-0/0     assert artifact_cls.type() not in cls._artifact_types, (
Qwen3-8B-0/0            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Qwen3-8B-0/0 AssertionError: Artifact of type=precompile already registered in mega-cache artifact factory

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed (git commit -s -S).

Make sure you read and follow the Security Best Practices (e.g. avoiding hardcoded trust_remote_code=True, torch.load(..., weights_only=False), pickle, etc.).

Is this change backward compatible?: ✅
If you copied code from any other sources or added a new PIP dependency, did you follow guidance in CONTRIBUTING.md: ❌
Did you write any new necessary tests?: ❌
Did you update Changelog?: ❌

Summary by CodeRabbit

Bug Fixes
- Improved startup compatibility with certain Torch-based optimization tools by ensuring required runtime components are loaded during package initialization to prevent initialization errors.
- No changes to public APIs or exported modules; behavior and interfaces remain unchanged.

copy-pr-bot · 2026-04-09T23:00:27Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

coderabbitai · 2026-04-09T23:00:40Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: dea4b7b8-5161-4b1f-b087-6a8d2fa5671f

📥 Commits

Reviewing files that changed from the base of the PR and between e7f3bce and 8e1696e.

📒 Files selected for processing (1)

modelopt/torch/__init__.py

🚧 Files skipped from review as they are similar to previous changes (1)

modelopt/torch/init.py

📝 Walkthrough

Walkthrough

The modelopt/torch/__init__.py module now imports importlib and calls importlib.import_module("torch._dynamo") during package initialization, before importing the package’s submodules.

Changes

Cohort / File(s)	Summary
Torch Module Initialization `modelopt/torch/__init__.py`	Added `importlib` and an explicit `importlib.import_module("torch._dynamo")` call at package init; reorganized submodule import lines into a parenthesized, multi-line form (no public API changes).

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

🚥 Pre-merge checks | ✅ 4

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and specifically summarizes the main change: pre-initializing torch._dynamo to prevent double-registration errors with peft's torch.compile() call.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Security Anti-Patterns	✅ Passed	PR modifies only modelopt/torch/init.py to pre-initialize torch._dynamo using importlib.import_module(). No security anti-patterns detected.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch hungyuehc/fix_local_run

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-04-09T23:04:25Z

PR Preview Action v1.8.1
Preview removed because the pull request was closed.
2026-04-11 08:11 UTC

codecov · 2026-04-09T23:12:59Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 77.46%. Comparing base (9050188) to head (89b860d).
⚠️ Report is 3 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1228      +/-   ##
==========================================
+ Coverage   72.14%   77.46%   +5.31%     
==========================================
  Files         350      350              
  Lines       40478    40480       +2     
==========================================
+ Hits        29202    31356    +2154     
+ Misses      11276     9124    -2152

Flag	Coverage Δ
examples	`44.14% <100.00%> (+1.18%)`	⬆️
gpu	`57.40% <100.00%> (+9.91%)`	⬆️
unit	`55.53% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

modelopt/torch/__init__.py

coderabbitai

🧹 Nitpick comments (1)

modelopt/torch/__init__.py (1)
24-25: Consider deferring torch._dynamo initialization to avoid coupling all users to this private Torch submodule at import time.

Line 25 imports torch._dynamo unconditionally for all users of modelopt.torch, even those not using torch.compile(). Since PEFT and compile features are optional integrations, and the codebase already uses lazy initialization elsewhere (e.g., speculative/plugins/transformers.py:625), move this import behind a guard or defer it until needed:
♻️ Proposed lazy approach
 # Pre-initialize torch._dynamo to prevent double-registration with peft's torch.compile() call
-importlib.import_module("torch._dynamo")
+try:
+    importlib.import_module("torch._dynamo")
+except ImportError:
+    _warnings.warn(
+        "torch._dynamo is unavailable; compile-related integrations may initialize it later.",
+        RuntimeWarning,
+    )
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@modelopt/torch/__init__.py` around lines 24 - 25, The unconditional
importlib.import_module("torch._dynamo") in modelopt.torch causes eager coupling
to a private Torch submodule; change it to a lazy/guarded initialization so
torch._dynamo is only imported when needed (e.g., when PEFT or torch.compile
integrations are invoked). Add a small helper (e.g., ensure_dynamo_initialized
or a guarded branch in the code paths that use torch.compile) that calls
importlib.import_module("torch._dynamo") the first time it's required, and
remove the top-level import from the module init so normal imports of
modelopt.torch don't trigger torch._dynamo import.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@modelopt/torch/__init__.py`:
- Around line 24-25: The unconditional importlib.import_module("torch._dynamo")
in modelopt.torch causes eager coupling to a private Torch submodule; change it
to a lazy/guarded initialization so torch._dynamo is only imported when needed
(e.g., when PEFT or torch.compile integrations are invoked). Add a small helper
(e.g., ensure_dynamo_initialized or a guarded branch in the code paths that use
torch.compile) that calls importlib.import_module("torch._dynamo") the first
time it's required, and remove the top-level import from the module init so
normal imports of modelopt.torch don't trigger torch._dynamo import.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: d9336e4b-80aa-44c3-8dda-90ed3ad9371c

📥 Commits

Reviewing files that changed from the base of the PR and between 2de191a and e7f3bce.

📒 Files selected for processing (1)

modelopt/torch/__init__.py

…'s torch.compile() call Signed-off-by: Hung-Yueh Chiang <hungyuehc@nvidia.com>

Signed-off-by: Hung-Yueh Chiang <hungyuehc@nvidia.com>

shengliangxu

LGTM

kevalmorabia97 · 2026-04-10T05:26:57Z

/ok to test 8e1696e

kevalmorabia97 · 2026-04-11T07:25:37Z

/ok to test 89b860d

…t` torch.compile() call (#1228) ### What does this PR do? Type of change: ? Bug fix This PR fixes the error `AssertionError: Artifact of type=precompile already registered in mega-cache artifact factory` when launching with local docker. The error message: ``` Qwen3-8B-0/0 Traceback (most recent call last): Qwen3-8B-0/0 File "/nemo_run/code/modules/Megatron-LM/examples/post_training/modelopt/quantize.py", line 21, in <module> Qwen3-8B-0/0 import modelopt.torch.quantization as mtq Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/modelopt/torch/__init__.py", line 23, in <module> Qwen3-8B-0/0 from . import distill, nas, opt, peft, prune, quantization, sparsity, speculative, utils Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/modelopt/torch/prune/__init__.py", line 24, in <module> Qwen3-8B-0/0 from . import fastnas, gradnas, plugins Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/modelopt/torch/prune/gradnas.py", line 71, in <module> Qwen3-8B-0/0 from transformers.models.bert.modeling_bert import BertAttention Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/transformers/models/bert/modeling_bert.py", line 30, in <module> Qwen3-8B-0/0 from ...generation import GenerationMixin Qwen3-8B-0/0 File "<frozen importlib._bootstrap>", line 1412, in _handle_fromlist Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/transformers/utils/import_utils.py", line 2317, in __getattr__ Qwen3-8B-0/0 module = self._get_module(self._class_to_module[name]) Qwen3-8B-0/0 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/transformers/utils/import_utils.py", line 2347, in _get_module Qwen3-8B-0/0 raise e Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/transformers/utils/import_utils.py", line 2345, in _get_module Qwen3-8B-0/0 return importlib.import_module("." + module_name, self.__name__) Qwen3-8B-0/0 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Qwen3-8B-0/0 File "/usr/lib/python3.12/importlib/__init__.py", line 90, in import_module Qwen3-8B-0/0 return _bootstrap._gcd_import(name[level:], package, level) Qwen3-8B-0/0 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/transformers/generation/utils.py", line 43, in <module> Qwen3-8B-0/0 from ..masking_utils import create_masks_for_generate Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/transformers/masking_utils.py", line 40, in <module> Qwen3-8B-0/0 from torch._dynamo._trace_wrapped_higher_order_op import TransformGetItemToIndex Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/torch/_dynamo/__init__.py", line 13, in <module> Qwen3-8B-0/0 from . import ( Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/torch/_dynamo/aot_compile.py", line 16, in <module> Qwen3-8B-0/0 from torch._dynamo.package import SystemInfo Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/torch/_dynamo/package.py", line 443, in <module> Qwen3-8B-0/0 @CacheArtifactFactory.register Qwen3-8B-0/0 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Qwen3-8B-0/0 File "/usr/local/lib/python3.12/dist-packages/torch/compiler/_cache.py", line 72, in register Qwen3-8B-0/0 assert artifact_cls.type() not in cls._artifact_types, ( Qwen3-8B-0/0 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Qwen3-8B-0/0 AssertionError: Artifact of type=precompile already registered in mega-cache artifact factory ``` ### Before your PR is "*Ready for review*" Make sure you read and follow [Contributor guidelines](https://github.com/NVIDIA/Model-Optimizer/blob/main/CONTRIBUTING.md) and your commits are signed (`git commit -s -S`). Make sure you read and follow the [Security Best Practices](https://github.com/NVIDIA/Model-Optimizer/blob/main/SECURITY.md#security-coding-practices-for-contributors) (e.g. avoiding hardcoded `trust_remote_code=True`, `torch.load(..., weights_only=False)`, `pickle`, etc.). - Is this change backward compatible?: ✅ - If you copied code from any other sources or added a new PIP dependency, did you follow guidance in `CONTRIBUTING.md`: ❌ - Did you write any new necessary tests?: ❌ - Did you update [Changelog](https://github.com/NVIDIA/Model-Optimizer/blob/main/CHANGELOG.rst)?: ❌  ## Summary by CodeRabbit * **Bug Fixes** * Improved startup compatibility with certain Torch-based optimization tools by ensuring required runtime components are loaded during package initialization to prevent initialization errors. * No changes to public APIs or exported modules; behavior and interfaces remain unchanged.  --------- Signed-off-by: Hung-Yueh Chiang <hungyuehc@nvidia.com>

hychiang-git requested a review from a team as a code owner April 9, 2026 23:00

hychiang-git requested review from ChenhanYu, Edwardf0t1, kevalmorabia97 and shengliangxu and removed request for ChenhanYu, Edwardf0t1 and shengliangxu April 9, 2026 23:00

shengliangxu reviewed Apr 9, 2026

View reviewed changes

modelopt/torch/__init__.py Outdated Show resolved Hide resolved

coderabbitai bot reviewed Apr 9, 2026

View reviewed changes

hychiang-git added 2 commits April 9, 2026 16:37

Pre-initialize torch._dynamo to prevent double-registration with peft…

203e219

…'s torch.compile() call Signed-off-by: Hung-Yueh Chiang <hungyuehc@nvidia.com>

Use importlib to avoid polluting the namesapce, and fix ruff check

8e1696e

Signed-off-by: Hung-Yueh Chiang <hungyuehc@nvidia.com>

hychiang-git force-pushed the hungyuehc/fix_local_run branch from e7f3bce to 8e1696e Compare April 9, 2026 23:37

hychiang-git requested a review from shengliangxu April 9, 2026 23:42

shengliangxu approved these changes Apr 9, 2026

View reviewed changes

hychiang-git enabled auto-merge (squash) April 10, 2026 00:03

kevalmorabia97 approved these changes Apr 10, 2026

View reviewed changes

hychiang-git and others added 5 commits April 9, 2026 23:05

Merge branch 'main' into hungyuehc/fix_local_run

566917a

Merge branch 'main' into hungyuehc/fix_local_run

88f4d17

Merge branch 'main' into hungyuehc/fix_local_run

d232042

Merge branch 'main' into hungyuehc/fix_local_run

25d6f16

Merge branch 'main' into hungyuehc/fix_local_run

89b860d

hychiang-git merged commit 0357cb9 into main Apr 11, 2026
44 checks passed

hychiang-git deleted the hungyuehc/fix_local_run branch April 11, 2026 08:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pre-initialize torch._dynamo to prevent double-registration with `peft` torch.compile() call#1228

Pre-initialize torch._dynamo to prevent double-registration with `peft` torch.compile() call#1228
hychiang-git merged 7 commits intomainfrom
hungyuehc/fix_local_run

hychiang-git commented Apr 9, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

copy-pr-bot bot commented Apr 9, 2026

Uh oh!

coderabbitai bot commented Apr 9, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Uh oh!

github-actions bot commented Apr 9, 2026 •

edited

Loading

Uh oh!

codecov bot commented Apr 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

shengliangxu left a comment

Uh oh!

kevalmorabia97 commented Apr 10, 2026

Uh oh!

kevalmorabia97 commented Apr 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hychiang-git commented Apr 9, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before your PR is "Ready for review"

Summary by CodeRabbit

Uh oh!

copy-pr-bot bot commented Apr 9, 2026

Uh oh!

coderabbitai bot commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Uh oh!

github-actions bot commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

shengliangxu left a comment

Choose a reason for hiding this comment

Uh oh!

kevalmorabia97 commented Apr 10, 2026

Uh oh!

kevalmorabia97 commented Apr 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hychiang-git commented Apr 9, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 9, 2026 •

edited

Loading

github-actions bot commented Apr 9, 2026 •

edited

Loading

codecov bot commented Apr 9, 2026 •

edited

Loading