Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][chore] Waive failed tests
#13035 opened Apr 14, 2026 by yiqingy0 Collaborator Loading…
1 task done
[None][feat] Optimize nemotron-h from python level
#13032 opened Apr 14, 2026 by Wanli-Jiang Collaborator Loading…
1 task done
[None][feat] add time sync for cache transceiver
#13031 opened Apr 14, 2026 by chuangz0 Collaborator Draft
1 task
[None][chore] Add aggregated benchmark in slurm.
#13030 opened Apr 14, 2026 by dominicshanshan Collaborator Loading…
1 task done
[None][refactor] Batch addSequence with two-phase claim and unified V…
#13029 opened Apr 14, 2026 by liji-nv Collaborator Loading…
1 task
[None][test] Add sync_qa_tests Jenkins script and sync-qa-tests skill
#13028 opened Apr 14, 2026 by xinhe-nv Collaborator Loading…
1 task done
[None][fix] handle broken symlinks in build_wheel.py install_file
#13027 opened Apr 14, 2026 by zhenhuaw-me Member Loading…
1 task done
[None][fix] fix Wan unit tests VisualGen
#13026 opened Apr 14, 2026 by zhenhuaw-me Member Loading…
1 task done
[None][fix] Fix chunked prefill API contract for nemotron nano VL
#13025 opened Apr 14, 2026 by 2ez4bz Collaborator Loading…
1 task done
User/imant/arbitrary kv cache transfer
#13018 opened Apr 14, 2026 by Tabrizian Member Draft
1 task
[feat][None] Fused moe all-reduce routed scaling factor + quant support Community want to contribute PRs initiated from Community
#13015 opened Apr 13, 2026 by murphymatt Loading…
1 task done
[None][perf] AutoDeploy: reduce C++ dispatch overhead in decode scheduling loop
#13012 opened Apr 13, 2026 by nvchenghaoz Collaborator Loading…
2 tasks done
[None][perf] triton paged attention: non-pow2 head_dim, decode speedup, logit cap
#13010 opened Apr 13, 2026 by nvchenghaoz Collaborator Loading…
2 tasks done
[https://nvbugs/6070878][fix] Skip gemma3 fp8 test only on L40S
#13009 opened Apr 13, 2026 by brb-nv Collaborator Loading…
1 task done
[None][fix] Enable LoRA in PRAD speculative decoding
#13007 opened Apr 13, 2026 by Funatiq Collaborator Draft
1 task
[TRTLLM-11540][feat] Revert EAGLE3 dynamic tree speculative decoding support (#12062)
#13006 opened Apr 13, 2026 by brb-nv Collaborator Loading…
1 task done
[None][fix] Enable LoRA in EAGLE3 speculative decoding
#13005 opened Apr 13, 2026 by Funatiq Collaborator Draft
1 task
[None][infra] Support nv sa benchmark in CI Perf Test
#13004 opened Apr 13, 2026 by chenfeiz0326 Collaborator Loading…
1 task done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.