vllm-project / vllm-gaudi Public

Notifications You must be signed in to change notification settings
Fork 73
Star 17

Code
Issues 1
Pull requests 70
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: vllm-project/vllm-gaudi

Labels 12 Milestones 1

New pull request New

70 Open 576 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Doc changes from main to 0.11.2 documentation

Improvements or additions to documentation

skip-gaudi-tests

#655 opened Nov 27, 2025 by mhelf-intel

Loading…

Enabling qwen3-0.6b server/benchmark on a docker

#654 opened Nov 27, 2025 by PatrykWo

Loading…

Fix for links in docker md documentation

Improvements or additions to documentation

skip-gaudi-tests

#653 opened Nov 27, 2025 by PatrykWo

Loading…

[FIX_FOR_VLLM_LATEST] Fix the Attention imports

#652 opened Nov 27, 2025 by pawel-olejniczak

Loading…

[GAUDISW-228042] Add support for dynamic quant with V scales on hidden dim

#650 opened Nov 27, 2025 by dudilester

Loading…

Trigger CI test to verigy VLLM NIXL functionality

#649 opened Nov 27, 2025 by amathewc

Loading…

platform: optimize grouped topk op

#647 opened Nov 27, 2025 by xinyu-intel

Loading…

make mla weight contiguous

#646 opened Nov 27, 2025 by xinyu-intel

Loading…

bucket: add query len 1 to prefill bucket

#645 opened Nov 27, 2025 by xinyu-intel

Loading…

Hybrid KV cache for hpu

#644 opened Nov 26, 2025 by michalkuligowski • Draft

Resolve issue with async scheduling when decode and prompt tokens are mixed

#642 opened Nov 26, 2025 by tianmu-li

Loading…

[CI] GSM8K: don't override lm_eval batch_size & don't disable prefix caching by default

#640 opened Nov 26, 2025 by kzawora-intel

Loading…

Removing external links from the main page documentation

Improvements or additions to documentation

skip-gaudi-tests

#638 opened Nov 26, 2025 by PatrykWo

Loading…

nixl: add lancher and proxy server for PD workloads

#635 opened Nov 26, 2025 by xinyu-intel • Draft

Fix filter for edge case & prefill bs > 1

#634 opened Nov 26, 2025 by adobrzyn

Loading…

Fix LoRA tests

#630 opened Nov 25, 2025 by vivekgoe

Loading…

Spec decode warmup support

#624 opened Nov 25, 2025 by jerrychenhf

Loading…

Fix environment setup for FP8

#623 opened Nov 25, 2025 by yiliu30

Loading…

Enable dequant fp8 weights quantized per-channel with compressed-tensor method

#621 opened Nov 24, 2025 by mandy-li

Loading…

enable spec decode for Unified Attention, part1

#619 opened Nov 21, 2025 by xuechendi

Loading…

Fix transformers version mismatch causing editable install failure

#618 opened Nov 21, 2025 by Saiteja-Garlapati

Loading…

lora, fix for PR28545

#617 opened Nov 21, 2025 by iboiko-habana

Loading…

Allow building vllm-plugin docker for ubuntu with upstream torch

#613 opened Nov 21, 2025 by mmuszynskihabana

Loading…

Sleep mode support

#584 opened Nov 18, 2025 by Kacper-Pietkun

Loading…

[DOCKER update] update docker to 1.23, transformers to 4.56.0

#580 opened Nov 17, 2025 by xuechendi • Draft

Previous 1 2 3 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!