Commit e12fcbd
fix(qwen35): early SSM probe fixes DeltaNet layer detection — Qwen3.5-4B now coherent
The fused QKV guard (delta_a_log check) ran BEFORE delta_a_log was
set in the per-layer loop, so DeltaNet layers still got gguf_w_qkv
assigned. Fixed by probing blk.N.ssm_a BEFORE the fused QKV check.
Result: n_attn_layers correctly reports 8 (not 32) for Qwen3.5-4B.
DeltaNet layers dispatch to deltanet_forward, full attention layers
to self_attn_forward with partial RoPE + NeoX rotation.
Validated:
Qwen3.5-4B: 'The capital of France is **Paris**.'
Qwen3-4B: coherent (4.5 tok/s)
Phi-3.5: coherent (1.9 tok/s)
35/35 unit tests pass
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent b8a27d2 commit e12fcbd
1 file changed
+12
-12
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
11851 | 11851 | | |
11852 | 11852 | | |
11853 | 11853 | | |
11854 | | - | |
11855 | | - | |
11856 | | - | |
11857 | | - | |
11858 | | - | |
11859 | | - | |
11860 | | - | |
11861 | | - | |
| 11854 | + | |
| 11855 | + | |
| 11856 | + | |
| 11857 | + | |
| 11858 | + | |
| 11859 | + | |
| 11860 | + | |
| 11861 | + | |
| 11862 | + | |
| 11863 | + | |
| 11864 | + | |
11862 | 11865 | | |
11863 | 11866 | | |
11864 | | - | |
11865 | | - | |
11866 | | - | |
11867 | | - | |
| 11867 | + | |
11868 | 11868 | | |
11869 | 11869 | | |
11870 | 11870 | | |
| |||
0 commit comments