[model] fix: register both MTP submodule spellings in qwen3_next_bridge by Zhichenzzz · Pull Request #11 · radixark/Megatron-Bridge

Zhichenzzz · 2026-06-10T04:08:22Z

Megatron-LM renamed the MTP submodule transformer_layer → mtp_model_layer (upstream NVIDIA main has no old-name API surface left). qwen3_next_bridge still registered only the old spelling, so MTP weights fail to map against a renamed Megatron-LM — while qwen35_vl_bridge and deepseek/common.py already expect the new name, and glm45_bridge handles both.

Register both spellings (glm45's approach) for all 12 MTP mappings (6 AutoMapping dict entries + QKV/GatedMLP/Replicated special mappings) so the bridge works with either Megatron-LM version. Mapping entries whose megatron param does not exist at runtime are never consulted, so the extra spelling is inert.

Validated: mapping_registry().megatron_to_hf_lookup resolves all 8 probe names (both spellings × router/QKV/expert-fc1/shared-gate) to the correct mapping types.

Companion PRs: radixark/Megatron-LM#54 (the rename itself), radixark/miles#1307 (miles converters).

…bridge Megatron-LM renamed the MTP submodule transformer_layer -> mtp_model_layer. Register every qwen3_next MTP weight mapping (MoE router, layernorms, attention, experts, shared expert) under both spellings so the bridge converts checkpoints from either Megatron-LM version, mirroring the miles converter change.

This was referenced Jun 10, 2026

fix(mtp): rename MTP submodule transformer_layer -> mtp_model_layer radixark/Megatron-LM#54

Open

fix(mtp): track megatron mtp_model_layer rename in raw converters radixark/miles#1307

Open

Zhichenzzz force-pushed the fix/qwen3next-mtp-rename branch from 4728c32 to a06b06d Compare June 18, 2026 21:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[model] fix: register both MTP submodule spellings in qwen3_next_bridge#11

[model] fix: register both MTP submodule spellings in qwen3_next_bridge#11
Zhichenzzz wants to merge 1 commit into
bridgefrom
fix/qwen3next-mtp-rename

Zhichenzzz commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Zhichenzzz commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant