Skip to content

Commit a434a2f

Browse files
committed
fix(scheduler): allow hybrid chunked prefill to chunk requests
1 parent d71bde4 commit a434a2f

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

vllm/v1/core/sched/scheduler.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -474,6 +474,7 @@ def schedule(self) -> SchedulerOutput:
474474
# pooling requests to be chunked
475475
if (
476476
not self.scheduler_config.chunked_prefill_enabled
477+
and not self.scheduler_config.enable_hybrid_chunked_prefill
477478
and num_new_tokens > token_budget
478479
):
479480
self.waiting.pop_request()

0 commit comments

Comments
 (0)