forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 50
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[rocm]use aiter triton kernel as triton mha fallback path
#809
opened Nov 14, 2025 by
zhuyuhua-v
•
Draft
Fix A16W4 shuffle weight and scale for aiter/main
#808
opened Nov 13, 2025 by
Rohan138
Loading…
5 tasks
Update ds script to support large block_size, safetensor and async-scheduling
#800
opened Nov 8, 2025 by
wuhuikx
Loading…
[DO NOT MERGE]Enable FP4 bmm for k_up_proj and v_up_proj in MLA
#797
opened Nov 7, 2025 by
ZhiweiYan-96
Loading…
5 tasks
Add Fused RMSNorm + FP8 Per-tensor Static Quantization to Llama 3 Models
#789
opened Nov 4, 2025 by
farlukas
Loading…
3 tasks done
[Triton] add a16w8 gemm for DS-R1 for o_proj for decode, add rocm_aiter_triton…
#788
opened Nov 4, 2025 by
k50112113
Loading…
[Triton] 355 wip Llama FP4 triton fusion + TP8 triton decode shape tunning
#783
opened Oct 31, 2025 by
k50112113
Loading…
add aiter fusion pattern for sequence parallel
#781
opened Oct 31, 2025 by
zhuyuhua-v
•
Draft
5 tasks
[feat](eplb): support eplb on rocm platform
#770
opened Oct 28, 2025 by
PerryZhang01
Loading…
5 tasks
[ROCM] Llama4 VLLM_ROCM_USE_AITER_TRITON_FUSED_ROPE_ZEROS_KV_CACHE support
#763
opened Oct 24, 2025 by
tpopp
Loading…
[WIP] Support persistent MLA for ROCm MLA backend
#739
opened Oct 16, 2025 by
ganyi1996ppo
Loading…
5 tasks
[Perf] refactor attention backend for perf boost
#713
opened Sep 26, 2025 by
ganyi1996ppo
Loading…
5 tasks
[355_wip] Let dynamo capture rms/silu_mul+f4gemm pattern
#705
opened Sep 24, 2025 by
xytpai
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.