Skip to content

Pull requests: ROCm/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add aiter fusion pattern for sequence parallel
#781 opened Oct 31, 2025 by zhuyuhua-v Draft
5 tasks
[MHA] add mha dispatch logic
#776 opened Oct 30, 2025 by gbyu-amd Loading…
5 tasks
Use UNIFORM QueryLen MLA for MTP
#773 opened Oct 29, 2025 by ZhiweiYan-96 Loading…
5 tasks
[feat](eplb): support eplb on rocm platform
#770 opened Oct 28, 2025 by PerryZhang01 Loading…
5 tasks
Update dev docker docs to remove old content
#769 opened Oct 27, 2025 by Rohan138 Loading…
5 tasks
Streaming logic for fused_exports and MOE
#768 opened Oct 27, 2025 by omuhamma Draft
5 tasks
Create determinism.md
#760 opened Oct 23, 2025 by shajrawi Loading…
[WIP] Support persistent MLA for ROCm MLA backend
#739 opened Oct 16, 2025 by ganyi1996ppo Loading…
5 tasks
Support fp8 with static scales
#725 opened Oct 3, 2025 by lburzawa Loading…
5 tasks
Quick port of fp4 fusedmoe
#724 opened Sep 30, 2025 by jpvillam-amd Loading…
Add dispatch for different mha backend
#722 opened Sep 29, 2025 by zhuyuhua-v Draft
5 tasks
Fix attn bug in qwen3-8b benchmark test
#721 opened Sep 28, 2025 by PerryZhang01 Loading…
5 tasks
update aiter fused_moe interface
#720 opened Sep 28, 2025 by zhiding512 Loading…
[Perf] refactor attention backend for perf boost
#713 opened Sep 26, 2025 by ganyi1996ppo Loading…
5 tasks
add hipblas in Docker build
#708 opened Sep 25, 2025 by dllehr-amd Loading…
5 tasks
[ROCm] Add allreduce dispatcher for ROCm device
#704 opened Sep 24, 2025 by zejunchen-zejun Loading…
Qwen-next script
#702 opened Sep 24, 2025 by ZhiweiYan-96 Loading…
5 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.