fix(0306): MoE prefill reductions for subfunction export by vbaddi · Pull Request #1028 · quic/efficient-transformers

vbaddi · 2026-06-03T14:01:30Z

Summary

Replaces selected MoE prefill and expert aggregation reductions with equivalent einsum forms for GPT-OSS, GLM-MOE and Qwen3-MoE. Adds tiny-model ONNX subfunction quickchecks that verify decoder subfunctions contain einsum.

cc: @quic-rishinr @mohiso22

Replace ReduceSum-prone MoE prefill aggregation paths with equivalent einsum reductions for GLM4-MoE, Qwen3-MoE, and GPT-OSS. Add tiny-model ONNX subfunction quickcheck coverage to verify the exported decoder subfunctions include Einsum and expected MoE custom ops. Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>

Revert Qwen3-MoE top-k probability normalization back to the original .sum(-1, keepdim=True) path, keeping only the prefill expert-output reduction as einsum. Targeted Qwen3-MoE subfunction and parity tests pass. Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>

vbaddi self-assigned this Jun 3, 2026

vbaddi added bugfix 1.22 Release 1.22 candidate labels Jun 3, 2026

vbaddi force-pushed the fix/subfunction-reducesum-einsum branch from 829f6e9 to d44816b Compare June 3, 2026 18:23

vbaddi added 2 commits June 4, 2026 10:03

vbaddi force-pushed the fix/subfunction-reducesum-einsum branch from d44816b to d75e27d Compare June 4, 2026 04:33

quic-rishinr requested review from mohiso22, ochougul and quic-hemagnih June 5, 2026 05:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(0306): MoE prefill reductions for subfunction export#1028

fix(0306): MoE prefill reductions for subfunction export#1028
vbaddi wants to merge 2 commits into
release/v1.22.0_tmpfrom
fix/subfunction-reducesum-einsum

vbaddi commented Jun 3, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

vbaddi commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vbaddi commented Jun 3, 2026 •

edited

Loading