forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 50
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WA][ROCm][DP] have a short-term WA for hip graph crash when capturing unsupported runtime op under DP+EP scenario
#784
opened Nov 3, 2025 by
zejunchen-zejun
Loading…
[Triton] 355 wip Llama FP4 triton fusion + TP8 triton decode shape tunning
#783
opened Oct 31, 2025 by
k50112113
Loading…
add aiter fusion pattern for sequence parallel
#781
opened Oct 31, 2025 by
zhuyuhua-v
•
Draft
5 tasks
[feat](eplb): support eplb on rocm platform
#770
opened Oct 28, 2025 by
PerryZhang01
Loading…
5 tasks
[ROCM] Llama4 VLLM_ROCM_USE_AITER_TRITON_FUSED_ROPE_ZEROS_KV_CACHE support
#763
opened Oct 24, 2025 by
tpopp
Loading…
[WIP] Support persistent MLA for ROCm MLA backend
#739
opened Oct 16, 2025 by
ganyi1996ppo
Loading…
5 tasks
[Perf] refactor attention backend for perf boost
#713
opened Sep 26, 2025 by
ganyi1996ppo
Loading…
5 tasks
[355_wip] Let dynamo capture rms/silu_mul+f4gemm pattern
#705
opened Sep 24, 2025 by
xytpai
Loading…
[ROCm] Add allreduce dispatcher for ROCm device
#704
opened Sep 24, 2025 by
zejunchen-zejun
Loading…
[ROCm] Add allreduce dispatcher for ROCm device
#695
opened Sep 18, 2025 by
zejunchen-zejun
Loading…
[ROCm] warpSize is being made non constexpr in ROCm 7.0 (#20330)
#694
opened Sep 18, 2025 by
xudonlyu
Loading…
[355_wip] Let inductor capture silu+mul+quant pattern and replace them with aiter operator
#669
opened Sep 11, 2025 by
xytpai
Loading…
support ck-tile fused bias gemm for rocm unquantized gemm
#668
opened Sep 11, 2025 by
eliotwang
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.