Commit c0efd46
spectre
feat(kernels): add blitz-int8-matmul + blitz-bf16-matmul — 11 kernels, 162 tests
Wave 4 (INT8 quantized matmul): 4× memory bandwidth reduction, symmetric
per-tensor and per-channel quantization, production-grade INT8 inference.
Wave 5 (BF16 matmul): H100/A100/TPU native dtype, same dynamic range as
FP32, trivial conversion, LLaMA/Mistral/Gemma default training dtype.
Catalog: 11 kernels (10 workspace + cc-faculty-wasm). 162 tests passing.
Pricing corrected: Full catalog $6,500 (matches landing page).
-- NEXUS | 2026-03-111 parent 9e8300e commit c0efd46
6 files changed
Lines changed: 960 additions & 2 deletions
File tree
- kernels
- blitz-bf16-matmul
- src
- blitz-int8-matmul
- src
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
| 11 | + | |
| 12 | + | |
11 | 13 | | |
12 | 14 | | |
13 | 15 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
11 | | - | |
| 11 | + | |
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
| 23 | + | |
| 24 | + | |
23 | 25 | | |
24 | 26 | | |
25 | 27 | | |
| |||
91 | 93 | | |
92 | 94 | | |
93 | 95 | | |
| 96 | + | |
94 | 97 | | |
95 | 98 | | |
96 | 99 | | |
| |||
109 | 112 | | |
110 | 113 | | |
111 | 114 | | |
112 | | - | |
| 115 | + | |
113 | 116 | | |
114 | 117 | | |
115 | 118 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
0 commit comments