Skip to content

Pull requests: lightseekorg/tokenspeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

test: glm-5.2 agentic bench
#532 opened Jun 26, 2026 by syuoni Member Draft
fix(dflash): fix serverl hang issue for dflash and add ci eval
#531 opened Jun 26, 2026 by yweng0828 Contributor Loading…
[WIP] Initial glm 5.2 support on amd
#528 opened Jun 26, 2026 by borontion Contributor Draft
fix(quant): honor layer-wise quantization exclusion for MoE experts
#527 opened Jun 26, 2026 by jaywme Collaborator Loading…
1 task
fix(spec): drive EAGLE3 aux capture layers from the draft config
#526 opened Jun 26, 2026 by jaywme Collaborator Loading…
[WIP] feat:support qwen3.5 dflash
#510 opened Jun 24, 2026 by minedec Contributor Draft
test(agentic): add EvalScope trie benchmark protocol
#466 opened Jun 17, 2026 by Xiangyi1996 Collaborator Draft
test(ci): add DeepSeek-V4-Flash MTP AIME25 eval
#461 opened Jun 16, 2026 by dongjiyingdjy Contributor Loading…
test: add dp4ep4 case in CI
#453 opened Jun 15, 2026 by tuanzhangCS Contributor Draft
[WIP] Refactor Cache Management
#447 opened Jun 15, 2026 by wangbo981016 Contributor Draft
Fix EP8 DP/TP RSAG init and empty LM head
#416 opened Jun 11, 2026 by yubofredwang Contributor Loading…
feat (L3 KVStore): prefetch and backup support
#293 opened May 28, 2026 by ehuohz Loading…
Add Triton sampling backends alongside FlashInfer
#280 opened May 27, 2026 by FlamingoPg Contributor Loading…
feat(trtllm-MHA): support mixed prefill/decode batches
#176 opened May 18, 2026 by rjzhb Collaborator Draft
4 tasks done
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.