[Feature] model_runner refactor #4764

zhenwenqi2024 · 2025-12-07T04:52:16Z

What this PR does / why we need it?

refactor npu_modelrunner， we should be close to gpu_modelrunner

Does this PR introduce any user-facing change?

NO

How was this patch tested?

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions · 2025-12-07T04:52:23Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request introduces a significant refactoring to align the vllm-ascend codebase with the upstream vLLM v1 API, particularly in the worker components. Key changes include refactoring InputBatch and BlockTable to inherit from their upstream counterparts, which reduces code duplication and improves maintainability. Additionally, it centralizes the management of Ascend-specific resources like mc2_mask, cos, and sin using global variables, moving away from passing them as parameters. The speculative decoding method deepseek_mtp has been generalized to mtp. Overall, these changes improve code structure and alignment with the main vLLM repository. My review has identified one high-severity issue related to an incorrect type hint that should be addressed.

gemini-code-assist · 2025-12-07T04:53:42Z

vllm_ascend/ascend_forward_context.py

+
+
+_mc2_tokens_capacity: Optional[int] = None
+_reserved_mc2_mask: Optional[int] = None


The type hint for _reserved_mc2_mask is Optional[int], but it is assigned a torch.Tensor of dtype=torch.bool in the set_mc2_mask function. This should be corrected to Optional[torch.Tensor] to accurately reflect the variable's type and prevent potential bugs.

Suggested change

_reserved_mc2_mask: Optional[int] = None

_reserved_mc2_mask: Optional[torch.Tensor] = None

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions · 2025-12-07T09:39:01Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: zhenwenqi2024 <[email protected]>

cleancode Signed-off-by: zhenwenqi2024 <[email protected]>

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions · 2025-12-08T03:07:54Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: zhenwenqi2024 <[email protected]>

realliujiaxu · 2025-12-08T08:31:58Z

typo in title and content : reactor/reafactor -> refactor

vllm_ascend/worker/model_runner_v1.py

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions · 2025-12-10T12:26:40Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions · 2025-12-10T14:35:17Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions · 2025-12-10T15:00:58Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions · 2025-12-10T15:54:51Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: zhenwenqi2024 <[email protected]>

[Feature] model_runner reactor

7a8d234

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions bot added the module:core label Dec 7, 2025

gemini-code-assist bot reviewed Dec 7, 2025

View reviewed changes

zhenwenqi2024 added 3 commits December 7, 2025 14:15

[Feature] model_runner reactor cleancode

dc85994

Signed-off-by: zhenwenqi2024 <[email protected]>

[Feature] model_runner reactor cleancode

39275af

Signed-off-by: zhenwenqi2024 <[email protected]>

[Feature] model_runner reactor cleancode

4976998

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions bot added the merge-conflicts label Dec 7, 2025

zhenwenqi2024 and others added 3 commits December 7, 2025 18:32

[Feature] model_runner reactor cleancode

1e61cfe

Signed-off-by: zhenwenqi2024 <[email protected]>

[Feature] model_runner reactor cleancode

63bd163

Signed-off-by: zhenwenqi2024 <[email protected]>

Merge branch 'main' into main

9f538de

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions bot removed the merge-conflicts label Dec 7, 2025

zhenwenqi2024 and others added 4 commits December 7, 2025 19:33

Update block_table.py

49631a6

cleancode Signed-off-by: zhenwenqi2024 <[email protected]>

Update model_runner_v1.py

8a9f342

Signed-off-by: zhenwenqi2024 <[email protected]>

Update block_table.py

d8e2ac0

Signed-off-by: zhenwenqi2024 <[email protected]>

[Feature] model_runner reactor cleancode

8c7c395

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions bot added the merge-conflicts label Dec 8, 2025

zhenwenqi2024 and others added 2 commits December 8, 2025 11:29

[Feature] model_runner reactor cleancode

90af6fb

Signed-off-by: zhenwenqi2024 <[email protected]>

Merge branch 'main' into main

55c6941

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions bot removed the merge-conflicts label Dec 8, 2025

zhenwenqi2024 and others added 5 commits December 8, 2025 12:54

Update block_table.py

7b66127

Signed-off-by: zhenwenqi2024 <[email protected]>

[Feature] model_runner reactor cleancode

82c7ed7

Signed-off-by: zhenwenqi2024 <[email protected]>

Merge branch 'main' of https://github.com/zhenwenqi2024/vllm-ascend

10ada78

[Feature] model_runner reactor cleancode

b4736a8

Signed-off-by: zhenwenqi2024 <[email protected]>

Update mtp_proposer.py

97c3141

Signed-off-by: zhenwenqi2024 <[email protected]>

zhenwenqi2024 changed the title ~~[Feature] model_runner reactor~~ [Feature] model_runner refactor Dec 8, 2025

MengqingCao reviewed Dec 8, 2025

View reviewed changes

vllm_ascend/worker/model_runner_v1.py Outdated Show resolved Hide resolved

github-actions bot removed the merge-conflicts label Dec 10, 2025

cleancode

500449f

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions bot added the merge-conflicts label Dec 10, 2025

zhenwenqi2024 and others added 2 commits December 10, 2025 20:37

cleancode

b738c89

Signed-off-by: zhenwenqi2024 <[email protected]>

Merge branch 'main' into main

b20c90c

github-actions bot removed the merge-conflicts label Dec 10, 2025

zhenwenqi2024 and others added 9 commits December 10, 2025 20:57

Delete vllm_ascend/torchair/torchair_mtp_proposer.py

0f553e3

Signed-off-by: zhenwenqi2024 <[email protected]>

cleancode

5faa5ff

Signed-off-by: zhenwenqi2024 <[email protected]>

Merge branch 'main' of https://github.com/zhenwenqi2024/vllm-ascend

e4b70d8

cleancode

92cf10c

Signed-off-by: zhenwenqi2024 <[email protected]>

cleancode

9c31f1f

Signed-off-by: zhenwenqi2024 <[email protected]>

Merge branch 'main' into main

bef8afc

cleancode

3827d68

Signed-off-by: zhenwenqi2024 <[email protected]>

Merge branch 'main' of https://github.com/zhenwenqi2024/vllm-ascend

aa5611c

cleancode

4b4cec4

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions bot added the merge-conflicts label Dec 10, 2025

Merge branch 'main' into main

ce505b5

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions bot added merge-conflicts and removed merge-conflicts labels Dec 10, 2025

zhenwenqi2024 added 2 commits December 10, 2025 23:32

Merge branch 'main' into main

5e6f94e

Signed-off-by: zhenwenqi2024 <[email protected]>

Merge branch 'main' into main

6d4eda7

github-actions bot removed the merge-conflicts label Dec 10, 2025

github-actions bot added the merge-conflicts label Dec 10, 2025

zhenwenqi2024 added 2 commits December 11, 2025 00:35

Merge remote-tracking branch 'upstream/main'

3d7f41f

cleancode

c9479b9

Signed-off-by: zhenwenqi2024 <[email protected]>

github-actions bot removed the merge-conflicts label Dec 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] model_runner refactor #4764

[Feature] model_runner refactor #4764

zhenwenqi2024 commented Dec 7, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Dec 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 7, 2025

Uh oh!

github-actions bot commented Dec 7, 2025

Uh oh!

github-actions bot commented Dec 8, 2025

Uh oh!

realliujiaxu commented Dec 8, 2025

Uh oh!

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants



		_mc2_tokens_capacity: Optional[int] = None
		_reserved_mc2_mask: Optional[int] = None

	_reserved_mc2_mask: Optional[int] = None
	_reserved_mc2_mask: Optional[torch.Tensor] = None

[Feature] model_runner refactor #4764

Are you sure you want to change the base?

[Feature] model_runner refactor #4764

Conversation

zhenwenqi2024 commented Dec 7, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Dec 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 7, 2025

Uh oh!

github-actions bot commented Dec 8, 2025

Uh oh!

realliujiaxu commented Dec 8, 2025

Uh oh!

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

zhenwenqi2024 commented Dec 7, 2025 •

edited by github-actions bot

Loading