-
Notifications
You must be signed in to change notification settings - Fork 855
Pull requests: vllm-project/vllm-omni
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feat] support cache-dit for DreamID-Omni
#3265
opened Apr 29, 2026 by
fywc
Contributor
Loading…
5 tasks
[BugFix][Bagel]: Fix vLLM-Omni as rollout bug: number of trajectory_latents count less
#3258
opened Apr 29, 2026 by
princepride
Collaborator
Loading…
[Bugfix][Qwen3TTS] Use float32 for code predictor on fp16-only GPUs
#3253
opened Apr 29, 2026 by
NickCao
Contributor
Loading…
3 of 5 tasks
feat: add embedding-based inference mode for CosyVoice3
#3249
opened Apr 29, 2026 by
daje0601
Loading…
Support both "voice" and "speaker" params in chat completions
#3248
opened Apr 29, 2026 by
QiuMike
Contributor
Loading…
5 tasks
[Bugfix] Only emit finish_reason="stop" on last modality in streaming
#3247
opened Apr 29, 2026 by
QiuMike
Contributor
Loading…
5 tasks
[CI][Bugfix] Improve e2e latency logging and update response classes to include detailed latency documentation
#3246
opened Apr 29, 2026 by
yenuo26
Collaborator
Loading…
5 tasks
glm-image: cache default HF processor to reduce input preprocessing latency
#3245
opened Apr 29, 2026 by
lyj-jjj
Contributor
Loading…
[Feature] support SP for FLUX.2-dev
#3244
opened Apr 29, 2026 by
nuclearwu
Contributor
Loading…
5 tasks
[WIP][Bugfix][HunyuanImage3] Fix offline AR garbage output by switching to Instruct chat template
#3243
opened Apr 29, 2026 by
TaffyOfficial
Contributor
Loading…
[Bugfix] Prevent silent failure of get_config when trust_remote_code passed as None
#3241
opened Apr 29, 2026 by
yuanheng-zhao
Contributor
Loading…
5 tasks
[Hardware] Extend diffusion engine plugin extensibility for out-of-tree hardware backends
#3239
opened Apr 29, 2026 by
yuchenjiangyj
•
Draft
3 of 5 tasks
[CI][Perf] Add high-load stress phase for Qwen3-TTS daily perf
#3238
opened Apr 29, 2026 by
linyueqian
Collaborator
Loading…
4 tasks
glm-image: fix(npu)per-stage runtime env for HCCL ports + GLM-Image NPU stage config
#3235
opened Apr 29, 2026 by
lyj-jjj
Contributor
Loading…
[Docs] Consolidate per-model TTS examples into a single hub
#3234
opened Apr 29, 2026 by
linyueqian
Collaborator
Loading…
[Bugfix] The diffusion config system blocked other i2i models from accepting multiple input images
#3233
opened Apr 29, 2026 by
RuixiangMa
Contributor
Loading…
5 tasks
[Rebase] Rebase to vllm 0.20.0
high priority
high priority issue, needs to be done asap
merge-test
label to trigger buildkite merge test CI
nightly-test
label to trigger buildkite nightly test CI
ready
label to trigger buildkite CI
#3232
opened Apr 29, 2026 by
tzhouam
Collaborator
Loading…
3 of 5 tasks
[BugFix] moss_tts_nano: eager-init lm + audio_tokenizer in __init__ so load_format: dummy works
#3230
opened Apr 28, 2026 by
leohuang257
Contributor
Loading…
3 of 5 tasks
[BugFix] voxcpm2: eager-init tts_model in __init__ so load_format: dummy works
#3229
opened Apr 28, 2026 by
leohuang257
Contributor
Loading…
3 of 5 tasks
[Cleanup] Use tokens_input() for TTS prompt construction
#3227
opened Apr 28, 2026 by
NickCao
Contributor
Loading…
2 of 5 tasks
[Perf] Raise max_num_seqs for Fish Speech and MiMo Audio decoder stages
#3226
opened Apr 28, 2026 by
NickCao
Contributor
Loading…
3 of 5 tasks
[AMD][CI][Bugfix] Fix "simple unit test"
#3225
opened Apr 28, 2026 by
tjtanaa
Contributor
Loading…
5 tasks
[CPU_OFFLOAD] offload transformer after switch to transformer-2
#3224
opened Apr 28, 2026 by
xuechendi
Contributor
Loading…
5 tasks
[Bugfix] Fix Qwen3 Omni AttributeError on processor loading with transformers<5.0
#3223
opened Apr 28, 2026 by
lvliang-intel
Contributor
Loading…
5 tasks
[Model] Add unified Qwen3-TTS model definition and Triton serving example with TensorRT codec
#3221
opened Apr 28, 2026 by
vklimkov-nvidia
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.