ROCm / vllm Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 50
Star 108

Code
Issues 5
Pull requests 48
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: ROCm/vllm

Labels 14 Milestones 0

New pull request New

48 Open 734 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[CI] temp reduce the gpu mem utilization

#810 opened Nov 14, 2025 by zejunchen-zejun

Loading…

[rocm]use aiter triton kernel as triton mha fallback path

#809 opened Nov 14, 2025 by zhuyuhua-v • Draft

Fix A16W4 shuffle weight and scale for aiter/main

#808 opened Nov 13, 2025 by Rohan138

Loading…

5 tasks

[CommFusion] Add allreduce+rmsnorm fusion kernel

#803 opened Nov 10, 2025 by xytpai • Draft

5 tasks

Update ds script to support large block_size, safetensor and async-scheduling

#800 opened Nov 8, 2025 by wuhuikx

Loading…

[DO NOT MERGE]Enable FP4 bmm for k_up_proj and v_up_proj in MLA

#797 opened Nov 7, 2025 by ZhiweiYan-96

Loading…

5 tasks

enable sequence parallel for rocm

#790 opened Nov 5, 2025 by zhuyuhua-v

Loading…

Add Fused RMSNorm + FP8 Per-tensor Static Quantization to Llama 3 Models

#789 opened Nov 4, 2025 by farlukas

Loading…

3 tasks done

[Triton] add a16w8 gemm for DS-R1 for o_proj for decode, add rocm_aiter_triton…

#788 opened Nov 4, 2025 by k50112113

Loading…

[Triton] 355 wip Llama FP4 triton fusion + TP8 triton decode shape tunning

#783 opened Oct 31, 2025 by k50112113

Loading…

add aiter fusion pattern for sequence parallel

#781 opened Oct 31, 2025 by zhuyuhua-v • Draft

5 tasks

[MHA] add mha dispatch logic

#776 opened Oct 30, 2025 by gbyu-amd

Loading…

5 tasks

Use UNIFORM QueryLen MLA for MTP

#773 opened Oct 29, 2025 by ZhiweiYan-96

Loading…

5 tasks

[feat](eplb): support eplb on rocm platform

#770 opened Oct 28, 2025 by PerryZhang01

Loading…

5 tasks

Streaming logic for fused_exports and MOE

#768 opened Oct 27, 2025 by omuhamma • Draft

5 tasks

[ROCM] Llama4 VLLM_ROCM_USE_AITER_TRITON_FUSED_ROPE_ZEROS_KV_CACHE support

#763 opened Oct 24, 2025 by tpopp

Loading…

Create determinism.md

#760 opened Oct 23, 2025 by shajrawi

Loading…

[WIP] Support persistent MLA for ROCm MLA backend

#739 opened Oct 16, 2025 by ganyi1996ppo

Loading…

5 tasks

Quick port of fp4 fusedmoe

#724 opened Sep 30, 2025 by jpvillam-amd

Loading…

Add dispatch for different mha backend

#722 opened Sep 29, 2025 by zhuyuhua-v • Draft

5 tasks

Fix attn bug in qwen3-8b benchmark test

#721 opened Sep 28, 2025 by PerryZhang01

Loading…

5 tasks

update aiter fused_moe interface

#720 opened Sep 28, 2025 by zhiding512

Loading…

[Perf] refactor attention backend for perf boost

#713 opened Sep 26, 2025 by ganyi1996ppo

Loading…

5 tasks

add hipblas in Docker build

#708 opened Sep 25, 2025 by dllehr-amd

Loading…

5 tasks

[355_wip] Let dynamo capture rms/silu_mul+f4gemm pattern

#705 opened Sep 24, 2025 by xytpai

Loading…

Previous 1 2 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!