Summary
Track the validation and performance of qwen3_32b_decode_mixed.py on both A3 and A5 platforms.
Motivation / Use Case
- The mixed-style decode kernel runs correctly on A3
- The mixed-style decode kernel runs correctly on A5
- Performance on both platforms is benchmarked and maintained at a competitive level
- Regressions are caught early as the compiler and runtime evolve
Proposed API / Behavior
Summary
Track the validation and performance of
qwen3_32b_decode_mixed.pyon both A3 and A5 platforms.Motivation / Use Case
Proposed API / Behavior