Skip to content

Add Support for MiniMax 2.5 Model#1014

Draft
vbaddi wants to merge 8 commits into
quic:release/v1.22.0_tmpfrom
vbaddi:dev/enable_minimax_25
Draft

Add Support for MiniMax 2.5 Model#1014
vbaddi wants to merge 8 commits into
quic:release/v1.22.0_tmpfrom
vbaddi:dev/enable_minimax_25

Conversation

@vbaddi
Copy link
Copy Markdown
Contributor

@vbaddi vbaddi commented May 30, 2026

No description provided.

vbaddi added 8 commits March 14, 2026 12:00
…T parity

- Rebased downstream wrapper stack to transformers==5.3.0 and aligned coupled deps
    (huggingface-hub, peft, diffusers) in project config.
- Updated model wrapper compatibility paths across causal/VLM/audio/export flows
    to match upstream v5 APIs while preserving downstream public behavior.
- Hardened cache compatibility layer and runtime glue for mixed legacy/new cache
    semantics used by downstream generation/export paths.
- Fixed attention/mask/rotary call-path mismatches introduced by upstream API
    changes (including model-specific signature updates).
- Updated AWQ/quantizer and export compatibility paths to remain ONNX-safe.
- Resolved MoE/export edge cases (including Mixtral/gpt_oss) to keep
    HF PyTorch -> downstream PyTorch -> ONNXRuntime token parity.
- Validation evidence:
    pyenv activate qeff.mainline
    python -m pytest -q tests/test_model_quickcheck.py -n 16
    Result: 26 passed.

Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
…odeling_qeff

Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
…and do fresh runs

Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
- add MiniMaxM2 MoE wrapper and KV/cache transform mappings
- add tiny-random/minimax-m2.5 causal runtime parity coverage
- handle MiniMax trust_remote_code compatibility in quickcheck

Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
@vbaddi vbaddi assigned vbaddi and tchawada and unassigned tchawada May 30, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants