Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Model] Support Step1 Model new-model Requests to new models v1
#32511 opened Jan 17, 2026 by randzero Loading…
5 tasks
Refactor KV cache updates across attention backends nvidia rocm Related to AMD ROCm v1
#32509 opened Jan 17, 2026 by VedantMadane Loading…
Add MTP for opanpangu_pro_moe model, fix an initialization bug in StaticSinkAttention bug Something isn't working v1
#32508 opened Jan 17, 2026 by yt0428 Loading…
5 tasks
[Fix] test test_function_calling_with_streaming_types about mcp
#32507 opened Jan 17, 2026 by lengrongfu Loading…
5 tasks
[Bugfix] Fix llama4_pythonic tool parser for nested list parameters bug Something isn't working llama Related to Llama models tool-calling
#32505 opened Jan 17, 2026 by karanb192 Loading…
3 of 4 tasks
[Bugfix] Fix Kimi-K2 tool parser streaming regex for multiple tool calls bug Something isn't working
#32504 opened Jan 17, 2026 by karanb192 Loading…
[Misc] Assign worker process titles and logging prefix earlier v1
#32503 opened Jan 17, 2026 by karanb192 Loading…
1 of 2 tasks
Adding LoRA support for qwen omni model qwen Related to Qwen models v1
#32500 opened Jan 17, 2026 by 0xD4rky Loading…
5 tasks
[Bugfix][Hardware][AMD] Fix RCCL initialization in Ray distributed executor bug Something isn't working rocm Related to AMD ROCm v1
#32497 opened Jan 17, 2026 by c0de128 Loading…
2 tasks
[Bugfix] Fix Llama 4 FP8 failure with FlashInfer on B200 (Nullptr crash) bug Something isn't working llama Related to Llama models nvidia
#32496 opened Jan 17, 2026 by lfopensource Loading…
5 tasks
Add embedding input functionality for disabled modalities [remake] documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) needs-rebase v1
#32493 opened Jan 16, 2026 by reaganjlee Draft
5 tasks
[RFC][ROCM] Enable aiter attn backend for qwen3-next model qwen Related to Qwen models rocm Related to AMD ROCm v1
#32492 opened Jan 16, 2026 by jennyyyyzhen Loading…
5 tasks
[WIP] Update FlashMLA ci/build ready ONLY add when PR is ready to merge/full CI is needed ready-run-all-tests Trigger CI with all tests for wide-ranging PRs
#32491 opened Jan 16, 2026 by LucasWilkinson Loading…
"refactor: refactor_repeated_interfaces" ready ONLY add when PR is ready to merge/full CI is needed
#32486 opened Jan 16, 2026 by tom-zju Loading…
5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.