Fix issue with async_scheduling when dealing with chunked input #359

tianmu-li · 2025-10-08T18:25:03Z

When dealing with chunked prompt (input sequence length > max_num_batched_tokens), sometimes there is no output token due to the chunked prompt, but the scheduler expects one. This PR addresses this issue and aligns the behavior with gpu_model_runner.

github-actions · 2025-10-08T18:25:14Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Signed-off-by: Tianmu Li <[email protected]>

github-actions · 2025-10-08T18:32:54Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Signed-off-by: Tianmu Li <[email protected]>

tianmu-li · 2025-10-09T01:39:31Z

@afierka-intel Can you check if this fixes the issue with long context? It works from my tests.

afierka-intel · 2025-10-09T06:21:51Z

/run-gaudi-tests

michalkuligowski · 2025-10-09T07:05:48Z

/run-gaudi-tests

michalkuligowski

Please cherrypick to 0.11 also

michalkuligowski · 2025-10-09T09:12:13Z

/run-gaudi-tests

github-actions · 2025-10-09T12:00:34Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
da3fa78dc98f3001e5fb703729a77311146e0cd3

michalkuligowski · 2025-10-10T07:48:03Z

/run-gaudi-tests

michalkuligowski · 2025-10-13T05:16:25Z

/run-gaudi-tests

github-actions · 2025-10-13T07:28:26Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
da3fa78dc98f3001e5fb703729a77311146e0cd3

michalkuligowski · 2025-10-13T08:38:20Z

/run-gaudi-tests

github-actions · 2025-10-13T12:05:03Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
da3fa78dc98f3001e5fb703729a77311146e0cd3

Fix issue with async_scheduling when dealing with chunked input

09c183d

Signed-off-by: Tianmu Li <[email protected]>

tianmu-li force-pushed the async_scheduling_chunk_fix branch from 1f87a51 to 09c183d Compare October 8, 2025 18:32

tianmu-li marked this pull request as ready for review October 8, 2025 18:42

tianmu-li requested review from mgawarkiewicz-intel, piotrbocian and wpyszka as code owners October 8, 2025 18:42

tianmu-li mentioned this pull request Oct 8, 2025

Fix issue with async_scheduling when dealing with chunked input #360

Open

tianmu-li changed the title ~~[WIP] Fix issue with async_scheduling when dealing with chunked input~~ Fix issue with async_scheduling when dealing with chunked input Oct 8, 2025

tianmu-li added 2 commits October 8, 2025 22:33

Dummy commit

5e339b1

Signed-off-by: Tianmu Li <[email protected]>

Clarify invalid_req_indices

68a272e

Signed-off-by: Tianmu Li <[email protected]>

michalkuligowski approved these changes Oct 9, 2025

View reviewed changes

Merge branch 'v0.10.2_next' into async_scheduling_chunk_fix

d2c88fe

Merge branch 'v0.10.2_next' into async_scheduling_chunk_fix

828401e

Merge branch 'v0.10.2_next' into async_scheduling_chunk_fix

6530113

Fix issue with async_scheduling when dealing with chunked input #359

Are you sure you want to change the base?

Fix issue with async_scheduling when dealing with chunked input #359

Uh oh!

Conversation

tianmu-li commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 8, 2025

🚧 CI Blocked

Uh oh!

github-actions bot commented Oct 8, 2025

🚧 CI Blocked

Uh oh!

tianmu-li commented Oct 9, 2025

Uh oh!

afierka-intel commented Oct 9, 2025

Uh oh!

michalkuligowski commented Oct 9, 2025

Uh oh!

michalkuligowski left a comment

Choose a reason for hiding this comment

Uh oh!

michalkuligowski commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 9, 2025

✅ CI Passed

Uh oh!

michalkuligowski commented Oct 10, 2025

Uh oh!

michalkuligowski commented Oct 13, 2025

Uh oh!

github-actions bot commented Oct 13, 2025

✅ CI Passed

Uh oh!

michalkuligowski commented Oct 13, 2025

Uh oh!

github-actions bot commented Oct 13, 2025

✅ CI Passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tianmu-li commented Oct 8, 2025 •

edited

Loading