[bugfix][refactor] fix recompute_scheduler break with vllm 0.12.0 & support async scheduling & refactor recompute_scheduler.py #4895

linfeng-yuan · 2025-12-10T15:04:33Z

What this PR does / why we need it?

Currently, the initialization and fundamental functions of RecomputeScheduler are broken with vLLM v0.12.0. This PR fixes the conflicts of RecomputeScheduler and refactor its implementations by inheriting original Scheduler of vLLM. Meanwhile, this PR also supports async cheduling with recompute scheduler by implementing AsyncRecomputeScheduler which is simply inherited AsncyScheduler of vLLM and RecomputeScheduler of vLLM-Ascend with python MRO.

Does this PR introduce any user-facing change?

No. The switch naming is the same as v0.11.0 : recompute_scheduler_enable

How was this patch tested?

E2E serving with 2P1D dsv3.1 passed. The performance was the same as original vllm scheduler with async_scheduling and preempted requests in D Nodes are successfully transfered to Proxy and further to P Node. This significantly improves the performance and robustness of PD disaggregation deployments.

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

gemini-code-assist

Code Review

The pull request introduces significant refactoring to the scheduler components, aiming to fix recompute issues, support async scheduling, and align with vLLM 0.12.0. A new AsyncRecomputeScheduler is added, and the core RecomputeScheduler is refactored to inherit from the base vLLM Scheduler class. However, there are critical issues related to class definitions and naming consistency that need to be addressed to ensure the code is functional.

gemini-code-assist · 2025-12-10T15:06:44Z

vllm_ascend/core/recompute_schedule_config.py

-    scheduler_cls: Union[str, Type[object]] = (
-        "vllm_ascend.core.recompute_scheduler.RecomputeScheduler")

    @classmethod


The @dataclass decorator and the class definition for RecomputeSchedulerConfig appear to be missing. This will lead to a syntax error or NameError when the initialize_from_config method is called, as it's currently defined as a standalone method without a class context. The class definition, likely inheriting from SchedulerConfig, needs to be restored.

@dataclass class RecomputeSchedulerConfig(SchedulerConfig): pass @classmethod

vllm_ascend/core/recompute_schedule_config.py

vllm_ascend/core/recompute_scheduler.py

vllm_ascend/core/async_recompute_scheduler.py

github-actions · 2025-12-10T16:12:17Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

…upport async scheduling & refactor recompute_scheduler.py Signed-off-by: linfeng-yuan <[email protected]>

Signed-off-by: linfeng-yuan <[email protected]>

linfeng-yuan · 2025-12-11T07:12:08Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces an AsyncRecomputeScheduler by inheriting from AsyncScheduler and RecomputeScheduler, and refactors the RecomputeScheduler to align with upstream vLLM changes. The RecomputeScheduler now inherits from vllm.v1.core.sched.scheduler.Scheduler, significantly reducing its __init__ method and moving much of its core logic to the base class. Key changes include updating RecomputeSchedulerConfig to accept VllmConfig and dynamically select between synchronous and asynchronous recompute schedulers, optimizing scheduling for async operations by avoiding extra steps, refining preemption logic to restore token budgets and related data, and adjusting speculative decoding token calculations. Additionally, the handling of encoder inputs and KV connector metadata is updated, and KV cache event collection and publishing are moved to the update_from_output method. The RecomputeSchedulerOutput dataclass now inherits from SchedulerOutput, simplifying its structure.

Signed-off-by: linfeng-yuan <[email protected]>

gemini-code-assist bot reviewed Dec 10, 2025

View reviewed changes

linfeng-yuan force-pushed the fix_and_refactor_recompute_scheduler branch 3 times, most recently from 08c4581 to b968591 Compare December 10, 2025 15:28

github-actions bot added the module:core label Dec 10, 2025

[bugfix][refactor] fix recompute_scheduler break with vllm 0.12.0 & s…

fce5b5d

…upport async scheduling & refactor recompute_scheduler.py Signed-off-by: linfeng-yuan <[email protected]>

linfeng-yuan force-pushed the fix_and_refactor_recompute_scheduler branch from b968591 to fce5b5d Compare December 10, 2025 18:31

fix lint

c995b55

Signed-off-by: linfeng-yuan <[email protected]>

gemini-code-assist bot reviewed Dec 11, 2025

View reviewed changes

fix mypy check

a78a835

Signed-off-by: linfeng-yuan <[email protected]>

linfeng-yuan force-pushed the fix_and_refactor_recompute_scheduler branch from ad094a0 to a78a835 Compare December 11, 2025 09:07

linfeng-yuan added 2 commits December 11, 2025 18:05

merge three python files into one

82298e3

Signed-off-by: linfeng-yuan <[email protected]>

fix ci

90b77d3

Signed-off-by: linfeng-yuan <[email protected]>

linfeng-yuan force-pushed the fix_and_refactor_recompute_scheduler branch from adcc6cf to 90b77d3 Compare December 11, 2025 11:30

fix ut error caused by file moving

285673c

Signed-off-by: linfeng-yuan <[email protected]>

wangxiyuan approved these changes Dec 11, 2025

View reviewed changes

wangxiyuan merged commit 0fbe083 into vllm-project:main Dec 11, 2025
25 checks passed

github-actions bot added the module:tests label Dec 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[bugfix][refactor] fix recompute_scheduler break with vllm 0.12.0 & support async scheduling & refactor recompute_scheduler.py #4895

[bugfix][refactor] fix recompute_scheduler break with vllm 0.12.0 & support async scheduling & refactor recompute_scheduler.py #4895

linfeng-yuan commented Dec 10, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

linfeng-yuan commented Dec 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[bugfix][refactor] fix recompute_scheduler break with vllm 0.12.0 & support async scheduling & refactor recompute_scheduler.py #4895

[bugfix][refactor] fix recompute_scheduler break with vllm 0.12.0 & support async scheduling & refactor recompute_scheduler.py #4895

Conversation

linfeng-yuan commented Dec 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

linfeng-yuan commented Dec 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

linfeng-yuan commented Dec 10, 2025 •

edited by github-actions bot

Loading