[Core] Encoder separation for Encode-Prefill-Decode Disaggregation #4176

amy-why-3459 · 2025-11-13T10:25:44Z

What this PR does / why we need it?

Encoder separation for Encode-Prefill-Decode Disaggregation

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.11.2
vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2

github-actions · 2025-11-13T10:25:52Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request introduces support for encode-prefill-decode disaggregation by separating the encoder execution path. Workers can now be designated as encoder producers, which exclusively run the multimodal encoder and transfer the resulting embeddings, bypassing the decoding process. The implementation correctly adds logic for producer and consumer roles. However, I've identified a critical bug where consumer ranks incorrectly re-execute the encoder, which overwrites the embeddings they just received. My review provides a necessary fix for this issue.

vllm_ascend/worker/model_runner_v1.py

ApsarasX · 2025-11-14T10:32:15Z

Are there any proxies that support EPD currently?

github-actions · 2025-11-24T09:10:42Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions · 2025-11-26T03:52:09Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions · 2025-12-01T11:06:29Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: amy-why-3459 <[email protected]>

MengqingCao · 2025-12-03T12:41:49Z

Thanks for this great work, do you have any plan to add e2e test for this feature?

amy-why-3459 · 2025-12-03T12:47:37Z

Thanks for this great work, do you have any plan to add e2e test for this feature?

We will add use cases as soon as possible.

…llm-project#4176) ### What this PR does / why we need it? Support Encoder separation for Encode-Prefill-Decode Disaggregation - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 Signed-off-by: amy-why-3459 <[email protected]> Signed-off-by: Che Ruan <[email protected]>

…llm-project#4176) ### What this PR does / why we need it? Support Encoder separation for Encode-Prefill-Decode Disaggregation - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 Signed-off-by: amy-why-3459 <[email protected]>

gemini-code-assist bot reviewed Nov 13, 2025

View reviewed changes

vllm_ascend/worker/model_runner_v1.py Show resolved Hide resolved

ApsarasX mentioned this pull request Nov 14, 2025

[RFC]: Encoder separation for Encode-Prefill-Decode Disaggregation #4115

Closed

zengchuang-hw mentioned this pull request Nov 17, 2025

[Task]: Encoder separation for vLLM-ascend JiusiServe/vllm-ascend#20

Open

github-actions bot added the merge-conflicts label Nov 24, 2025

amy-why-3459 force-pushed the epd_draft branch from 17f1100 to 63971b1 Compare November 24, 2025 11:14

github-actions bot added merge-conflicts and removed merge-conflicts labels Nov 24, 2025

amy-why-3459 force-pushed the epd_draft branch from 63971b1 to ad5aaa2 Compare November 26, 2025 04:00

github-actions bot removed the merge-conflicts label Nov 26, 2025

amy-why-3459 force-pushed the epd_draft branch 6 times, most recently from 170adfc to 073df5a Compare November 26, 2025 14:35

amy-why-3459 changed the title ~~Encoder separation for Encode-Prefill-Decode Disaggregation~~ [Core] Encoder separation for Encode-Prefill-Decode Disaggregation Nov 27, 2025

wangxiyuan approved these changes Nov 27, 2025

View reviewed changes

wangxiyuan added ready read for review ready-for-test start test by label for PR labels Nov 27, 2025

amy-why-3459 force-pushed the epd_draft branch 4 times, most recently from 92c5ecc to adce4f1 Compare November 28, 2025 10:25

github-actions bot added the merge-conflicts label Dec 1, 2025

Encoder separation for Encode-Prefill-Decode Disaggregation

85306a9

Signed-off-by: amy-why-3459 <[email protected]>

amy-why-3459 force-pushed the epd_draft branch from adce4f1 to 85306a9 Compare December 1, 2025 12:21

github-actions bot removed the merge-conflicts label Dec 1, 2025

MengqingCao merged commit 26e8e58 into vllm-project:main Dec 3, 2025
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Core] Encoder separation for Encode-Prefill-Decode Disaggregation #4176

[Core] Encoder separation for Encode-Prefill-Decode Disaggregation #4176

amy-why-3459 commented Nov 13, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Nov 13, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

ApsarasX commented Nov 14, 2025

Uh oh!

github-actions bot commented Nov 24, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

github-actions bot commented Dec 1, 2025

Uh oh!

MengqingCao commented Dec 3, 2025

Uh oh!

amy-why-3459 commented Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Core] Encoder separation for Encode-Prefill-Decode Disaggregation #4176

[Core] Encoder separation for Encode-Prefill-Decode Disaggregation #4176

Conversation

amy-why-3459 commented Nov 13, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Nov 13, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ApsarasX commented Nov 14, 2025

Uh oh!

github-actions bot commented Nov 24, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

github-actions bot commented Dec 1, 2025

Uh oh!

MengqingCao commented Dec 3, 2025

Uh oh!

amy-why-3459 commented Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

amy-why-3459 commented Nov 13, 2025 •

edited by github-actions bot

Loading