Skip to content

Pull requests: ROCm/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[CI] temp reduce the gpu mem utilization
#810 opened Nov 14, 2025 by zejunchen-zejun Loading…
Fix A16W4 shuffle weight and scale for aiter/main
#808 opened Nov 13, 2025 by Rohan138 Loading…
5 tasks
[CommFusion] Add allreduce+rmsnorm fusion kernel
#803 opened Nov 10, 2025 by xytpai Draft
5 tasks
enable sequence parallel for rocm
#790 opened Nov 5, 2025 by zhuyuhua-v Loading…
Add Fused RMSNorm + FP8 Per-tensor Static Quantization to Llama 3 Models
#789 opened Nov 4, 2025 by farlukas Loading…
3 tasks done
add aiter fusion pattern for sequence parallel
#781 opened Oct 31, 2025 by zhuyuhua-v Draft
5 tasks
[MHA] add mha dispatch logic
#776 opened Oct 30, 2025 by gbyu-amd Loading…
5 tasks
Use UNIFORM QueryLen MLA for MTP
#773 opened Oct 29, 2025 by ZhiweiYan-96 Loading…
5 tasks
[feat](eplb): support eplb on rocm platform
#770 opened Oct 28, 2025 by PerryZhang01 Loading…
5 tasks
Streaming logic for fused_exports and MOE
#768 opened Oct 27, 2025 by omuhamma Draft
5 tasks
Create determinism.md
#760 opened Oct 23, 2025 by shajrawi Loading…
[WIP] Support persistent MLA for ROCm MLA backend
#739 opened Oct 16, 2025 by ganyi1996ppo Loading…
5 tasks
Quick port of fp4 fusedmoe
#724 opened Sep 30, 2025 by jpvillam-amd Loading…
Add dispatch for different mha backend
#722 opened Sep 29, 2025 by zhuyuhua-v Draft
5 tasks
Fix attn bug in qwen3-8b benchmark test
#721 opened Sep 28, 2025 by PerryZhang01 Loading…
5 tasks
update aiter fused_moe interface
#720 opened Sep 28, 2025 by zhiding512 Loading…
[Perf] refactor attention backend for perf boost
#713 opened Sep 26, 2025 by ganyi1996ppo Loading…
5 tasks
add hipblas in Docker build
#708 opened Sep 25, 2025 by dllehr-amd Loading…
5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.