forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 48
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Use gfx11-ci Docker container for wheel build and kernel tests
#873
opened Apr 14, 2026 by
mgehre-amd
Loading…
1 of 3 tasks
Add gfx12 (RDNA4) Triton tile-size heuristic for W4A16 prefill kernel
#870
opened Apr 13, 2026 by
amd-xavierwang
Loading…
3 of 5 tasks
Remove HipW4A16SkinnyLinearKernel and no-group wvSplitK_int4
#869
opened Apr 13, 2026 by
mgehre-amd
Loading…
Enable FLASH_ATTN backend with upstream flash-attn CK on ROCm
#866
opened Apr 10, 2026 by
mgehre-amd
•
Draft
1 task
[CI/Build] Fix Dockerfile.rocm_base image build for ROCm 7.2
bug
Something isn't working
#863
opened Feb 23, 2026 by
jbelloncastro
Loading…
3 of 5 tasks
Add silu-and-mul and per-token dynamic FP8 quant fusion
stale
#852
opened Jan 6, 2026 by
kliuae-amd
Loading…
5 tasks
[Triton] fix rope kv_cache for RocmAiterUnifiedAttentionImpl + v.contiguous() remove
stale
#851
opened Jan 2, 2026 by
k50112113
Loading…
[rocm]use aiter triton kernel as triton mha fallback path
#809
opened Nov 14, 2025 by
zhuyuhua-v
•
Draft
add aiter fusion pattern for sequence parallel
#781
opened Oct 31, 2025 by
zhuyuhua-v
•
Draft
5 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.