Prompt file: prompts/bringup_eval_long.txt Target prompt length: 100-200 tokens (per-model tokenization varies) Generated tokens: 100
Note: Keep the table columns padded with spaces and right-justify numeric cells so it stays aligned in terminal views. Top-1 and Top-5 are whole percents (0 d.p.). Rows are sorted functional-first, then optimized.
| Model | Hardware | Variant | Top-1 | Top-5 | TTFT | t/s/u | Seq len |
|---|---|---|---|---|---|---|---|
| arcee-ai/Arcee-Spark | n150 | functional | 92% | 100% | 99ms | 13.9 | 29952 |
| arcee-ai/Arcee-Spark | n300 | functional | 91% | 100% | 338ms | 5.0 | 32768 |
| arcee-ai/Arcee-Spark | t3000 | functional | 90% | 100% | 343ms | 4.9 | 32768 |
| arcee-ai/AFM-4.5B | n150 | functional | 98% | 100% | 72ms | 17.2 | 65536 |
| arcee-ai/AFM-4.5B | n300 | functional | 97% | 100% | 283ms | 5.6 | 65536 |
| arcee-ai/AFM-4.5B | t3000 | functional | 98% | 100% | 181ms | 7.1 | 65536 |
| humain-ai/ALLaM-7B-Instruct-preview | n150 | functional | 97% | 100% | 76ms | 14.9 | 4096 |
| humain-ai/ALLaM-7B-Instruct-preview | n300 | functional | 97% | 100% | 184ms | 7.9 | 4096 |
| humain-ai/ALLaM-7B-Instruct-preview | t3000 | functional | 95% | 100% | 127ms | 9.1 | 4096 |
| meta-llama/Llama-3.2-1B | n150 | functional | 92% | 100% | 34ms | 39.5 | 131072 |
| meta-llama/Llama-3.2-1B | n300 | functional | 90% | 100% | 610ms | 6.7 | 131072 |
| meta-llama/Llama-3.2-1B | t3000 | functional | 92% | 100% | 267ms | 6.6 | 131072 |
| mistralai/Mistral-7B-Instruct-v0.3 | n150 | functional | 93% | 100% | 105ms | 16.5 | 32768 |
| mistralai/Mistral-7B-Instruct-v0.3 | n300 | functional | 96% | 100% | 112ms | 11.1 | 32768 |
| mistralai/Mistral-7B-Instruct-v0.3 | t3000 | functional | 95% | 100% | 104ms | 9.9 | 32768 |
| Qwen/Qwen3-0.6B | n150 | functional | 99% | 100% | 52ms | 28.0 | 40960 |
| Qwen/Qwen3-0.6B | n300 | functional | 99% | 100% | 943ms | 2.0 | 40960 |
| Qwen/Qwen3-0.6B | t3000 | functional | 98% | 100% | 229ms | 6.2 | 40960 |
| Qwen/Qwen3-30B-A3B | n150 | functional | 94% | 100% | 100081ms | 0.4 | 40960 |
| google/gemma-3-4b-it | n150 | functional | 92% | 100% | 98ms | 13.9 | 40960 |
| google/gemma-3-4b-it | n300 | functional | 94% | 100% | 535ms | 3.2 | 40960 |
| google/gemma-3-4b-it | t3000 | functional | 92% | 100% | 330ms | 4.7 | 40960 |
| microsoft/Phi-3-mini-128k-instruct | n150 | functional | 92% | 99% | 80ms | 13.7 | 12288 |
| microsoft/Phi-3-mini-128k-instruct | n300 | functional | 90% | 100% | 193ms | 6.7 | 12288 |
| microsoft/Phi-3-mini-128k-instruct | t3000 | functional | 90% | 100% | 184ms | 6.8 | 12288 |
| tiiuae/Falcon3-7B-Instruct | n150 | functional | 97% | 100% | 144ms | 13.4 | 32768 |
| tiiuae/Falcon3-7B-Instruct | n300 | functional | 97% | 100% | 661ms | 5.6 | 32768 |
| tiiuae/Falcon3-7B-Instruct | t3000 | functional | 97% | 100% | 199ms | 7.3 | 32768 |
| arcee-ai/Arcee-Spark | n150 | optimized | 91% | 100% | 77ms | 14.5 | 29952 |
| arcee-ai/Arcee-Spark | n300 | optimized | 85% | 100% | 101ms | 16.0 | 32768 |
| arcee-ai/Arcee-Spark | t3000 | optimized | 90% | 100% | 72ms | 17.6 | 32768 |
| arcee-ai/AFM-4.5B | n150 | optimized | 98% | 100% | 57ms | 19.6 | 65536 |
| arcee-ai/AFM-4.5B | n300 | optimized | 99% | 100% | 56ms | 23.6 | 65536 |
| arcee-ai/AFM-4.5B | t3000 | optimized | 98% | 100% | 69ms | 29.0 | 65536 |
| humain-ai/ALLaM-7B-Instruct-preview | n150 | optimized | 97% | 100% | 69ms | 15.8 | 4096 |
| humain-ai/ALLaM-7B-Instruct-preview | n300 | optimized | 97% | 100% | 69ms | 15.9 | 4096 |
| humain-ai/ALLaM-7B-Instruct-preview | t3000 | optimized | 97% | 100% | 61ms | 24.3 | 4096 |
| meta-llama/Llama-3.2-1B | n150 | optimized | 92% | 100% | 22ms | 64.8 | 131072 |
| meta-llama/Llama-3.2-1B | n300 | optimized | 91% | 100% | 31ms | 50.0 | 131072 |
| meta-llama/Llama-3.2-1B | t3000 | optimized | 94% | 100% | 36ms | 57.5 | 131072 |
| mistralai/Mistral-7B-Instruct-v0.3 | n150 | optimized | 96% | 100% | 90ms | 17.9 | 32768 |
| mistralai/Mistral-7B-Instruct-v0.3 | n300 | optimized | 97% | 100% | 44ms | 24.8 | 32768 |
| mistralai/Mistral-7B-Instruct-v0.3 | t3000 | optimized | 98% | 100% | 60ms | 26.6 | 32768 |
| Qwen/Qwen3-0.6B | n150 | optimized | 97% | 100% | 25ms | 46.2 | 40960 |
| Qwen/Qwen3-0.6B | n300 | optimized | 99% | 100% | 54ms | 55.3 | 40960 |
| Qwen/Qwen3-0.6B | t3000 | optimized | 98% | 100% | 59ms | 61.9 | 40960 |
| Qwen/Qwen3-30B-A3B | n150 | optimized | 96% | 100% | 2197ms | 4.8 | 40960 |
| google/gemma-3-4b-it | n150 | optimized | 92% | 100% | 70ms | 14.5 | 40960 |
| google/gemma-3-4b-it | n300 | optimized | 94% | 100% | 68ms | 18.5 | 40960 |
| google/gemma-3-4b-it | t3000 | optimized | 91% | 100% | 78ms | 19.4 | 40960 |
| microsoft/Phi-3-mini-128k-instruct | n150 | optimized | 94% | 99% | 69ms | 15.9 | 12288 |
| microsoft/Phi-3-mini-128k-instruct | n300 | optimized | 91% | 100% | 94ms | 18.3 | 12288 |
| microsoft/Phi-3-mini-128k-instruct | t3000 | optimized | 92% | 99% | 105ms | 23.6 | 12288 |
| tiiuae/Falcon3-7B-Instruct | n150 | optimized | 97% | 100% | 102ms | 15.2 | 32768 |
| tiiuae/Falcon3-7B-Instruct | n300 | optimized | 97% | 100% | 72ms | 21.8 | 32768 |
| tiiuae/Falcon3-7B-Instruct | t3000 | optimized | 97% | 100% | 58ms | 26.3 | 32768 |