drop ascend scheduler #4498

wangxiyuan · 2025-11-27T08:00:45Z

Ascend scheduler was added for non chunk prefill case before, since that the npu ops didn't work well with chunked prefill.

Now the ops with chunked prefill work better, it's time to remove the ascend scheduler to use vLLM default scheduler.

vLLM version: v0.11.2
vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2

github-actions · 2025-11-27T08:00:54Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request removes the ascend_scheduler feature. The changes are consistent across documentation, examples, tests, and source code, reflecting the removal of this functionality. However, I've identified a critical issue in one of the test files where the modification of server arguments has resulted in a malformed command, which will likely cause the test to fail. I've provided a suggestion to fix it.

tests/e2e/nightly/models/test_deepseek_v3_2_exp_w8a8.py

github-actions · 2025-11-27T09:48:34Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

wangxiyuan · 2025-11-27T12:04:19Z

/gemini review

gemini-code-assist

Code Review

This pull request removes the custom Ascend scheduler and related configurations, falling back to the default vLLM scheduler. This is a significant cleanup, touching documentation, tests, examples, and source code. The changes are mostly deletions and appear correct and consistent with the PR's goal. However, I found an issue in the documentation for TorchAirGraph. The examples are now incorrect because they enable TorchAirGraph without disabling chunked prefill, which is not a supported combination. This will lead to runtime errors for users following the guide. I've provided suggestions to fix the examples.

docs/source/user_guide/feature_guide/graph_mode.md

gemini-code-assist

Code Review

This pull request removes the ascend_scheduler and related configurations, aiming to use the default vLLM scheduler. The changes are extensive, touching documentation, examples, tests, and core logic to remove all traces of the ascend_scheduler. The refactoring is clean and aligns with the stated objective.

My main concerns are a potential change in default behavior for chunked prefill, and the removal of tests for SchedulerDynamicBatch which appears to still be in use. I've added critical and high severity comments regarding these issues.

wangxiyuan · 2025-11-28T00:56:55Z

CI passed here: https://github.com/vllm-project/vllm-ascend/actions/runs/19735428628/job/56546065316?pr=4498

qwen3-next test failed due to chunked prefill is enabled by default, now qwen3-next with mtp doesn't work. Let's disable it first.

github-actions · 2025-11-28T07:21:04Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

wangxiyuan · 2025-11-28T07:41:28Z

CI： https://github.com/vllm-project/vllm-ascend/actions/runs/19751104052/job/56594182797?pr=4498

whx-sjtu

I have confirmed that no multi-modal models depend on AscendScheduler anymore. LGTM.

Signed-off-by: wangxiyuan <[email protected]>

This reverts commit f10acdd.

Reverts #4498 - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2

Ascend scheduler was added for non chunk prefill case before, since that the npu ops didn't work well with chunked prefill. Now the ops with chunked prefill work better, it's time to remove the ascend scheduler to use vLLM default scheduler. - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <[email protected]>

Reverts vllm-project#4498 - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2

Ascend scheduler was added for non chunk prefill case before, since that the npu ops didn't work well with chunked prefill. Now the ops with chunked prefill work better, it's time to remove the ascend scheduler to use vLLM default scheduler. - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <[email protected]> Signed-off-by: Che Ruan <[email protected]>

Reverts vllm-project#4498 - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 Signed-off-by: Che Ruan <[email protected]>

Ascend scheduler was added for non chunk prefill case before, since that the npu ops didn't work well with chunked prefill. Now the ops with chunked prefill work better, it's time to remove the ascend scheduler to use vLLM default scheduler. - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <[email protected]> Signed-off-by: Che Ruan <[email protected]>

Reverts vllm-project#4498 - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 Signed-off-by: Che Ruan <[email protected]>

github-actions bot added documentation Improvements or additions to documentation module:tests module:core labels Nov 27, 2025

gemini-code-assist bot reviewed Nov 27, 2025

View reviewed changes

tests/e2e/nightly/models/test_deepseek_v3_2_exp_w8a8.py Show resolved Hide resolved

wangxiyuan force-pushed the remove_ascend_scheduler branch from d896281 to 153f779 Compare November 27, 2025 08:39

wangxiyuan added ready read for review ready-for-test start test by label for PR labels Nov 27, 2025

github-actions bot added the merge-conflicts label Nov 27, 2025

wangxiyuan force-pushed the remove_ascend_scheduler branch from 153f779 to dc5637d Compare November 27, 2025 11:55

github-actions bot removed the merge-conflicts label Nov 27, 2025

gemini-code-assist bot reviewed Nov 27, 2025

View reviewed changes

docs/source/user_guide/feature_guide/graph_mode.md Show resolved Hide resolved

docs/source/user_guide/feature_guide/graph_mode.md Show resolved Hide resolved

gemini-code-assist bot reviewed Nov 27, 2025

View reviewed changes

wangxiyuan force-pushed the remove_ascend_scheduler branch from dc5637d to 9b3d1b5 Compare November 28, 2025 01:00

menogrey mentioned this pull request Nov 28, 2025

[Doc] Refactor the DeepSeek-V3.1 tutorial. #4399

Merged

github-actions bot added the merge-conflicts label Nov 28, 2025

wangxiyuan force-pushed the remove_ascend_scheduler branch from 9b3d1b5 to 38eb2e9 Compare November 28, 2025 08:23

github-actions bot removed the merge-conflicts label Nov 28, 2025

MengqingCao mentioned this pull request Nov 29, 2025

[FixCI] Enable chunked prefill for auto-prefix-caching test #4551

Closed

whx-sjtu approved these changes Nov 29, 2025

View reviewed changes

wangxiyuan added 2 commits November 29, 2025 12:02

drop ascend scheduler

96c2a5c

Signed-off-by: wangxiyuan <[email protected]>

skip qwen3-next + mtp test

8db5756

Signed-off-by: wangxiyuan <[email protected]>

wangxiyuan force-pushed the remove_ascend_scheduler branch from 38eb2e9 to 8db5756 Compare November 29, 2025 04:25

menogrey mentioned this pull request Nov 29, 2025

add deepseek-r1-w8a8 tutorial. #4504

Closed

Yikun approved these changes Nov 29, 2025

View reviewed changes

wangxiyuan merged commit f10acdd into vllm-project:main Nov 29, 2025
21 of 22 checks passed

MengqingCao added a commit that referenced this pull request Nov 29, 2025

Revert "drop ascend scheduler (#4498)"

eb61f91

This reverts commit f10acdd.

MengqingCao mentioned this pull request Nov 29, 2025

Revert "drop ascend scheduler" #4580

Merged

MengqingCao added a commit that referenced this pull request Nov 29, 2025

Revert "drop ascend scheduler" (#4580)

517fd92

Reverts #4498 - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2

ChenCangtao pushed a commit to ChenCangtao/vllm-ascend that referenced this pull request Dec 3, 2025

Revert "drop ascend scheduler" (vllm-project#4580)

6d07cbe

Reverts vllm-project#4498 - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2

wangxiyuan deleted the remove_ascend_scheduler branch December 4, 2025 07:04

drop ascend scheduler #4498

drop ascend scheduler #4498

Uh oh!

Conversation

wangxiyuan commented Nov 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

wangxiyuan commented Nov 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

wangxiyuan commented Nov 28, 2025

Uh oh!

github-actions bot commented Nov 28, 2025

Uh oh!

wangxiyuan commented Nov 28, 2025

Uh oh!

whx-sjtu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wangxiyuan commented Nov 27, 2025 •

edited by github-actions bot

Loading