[Bugfix] dynamic eplb does't use fused_alltoall #4919

shenchuxiaofugui · 2025-12-11T10:26:56Z

What this PR does / why we need it?

The fused alltoall operator itself was not designed or implemented to handle the scenario where tensors are lists, but the weights for dynamic load balancing are in list form.
Therefore, we have disabled this operator when using dynamic load balancing.

Does this PR introduce any user-facing change?

No

How was this patch tested?

After the repair, the service was restarted, and the conversation proceeded normally.
{"id":"","object":"chat.completion","created":,"model":"dsr1","choices":[{"index":0,"message":{"role":"assistant","content":"\nOkay, the user is asking "What is deep learning?" Hmm, this seems like a fundamental question about AI. They might be a complete beginner or someone with some tech background looking to","refusal":null,"annotations":null,"audio":null,"function_call":null,"tool_calls":[],"reasoning":null,"reasoning_content":null},"logprobs":null,"finish_reason":"length","stop_reason":null,"token_ids":null}],"service_tier":null,"system_fingerprint":null,"usage":{"prompt_tokens":8,"total_tokens":48,"completion_tokens":40,"prompt_tokens_details":null},"prompt_logprobs":null,"prompt_token_ids":null,"kv_transfer_params":null}

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

gemini-code-assist

Code Review

This pull request addresses a bug where the dynamic expert parallelism load balancer (dynamic_eplb) was incorrectly used with the FUSED_ALLTOALL MoE communication method. The FUSED_ALLTOALL method, which relies on the highly optimized dispatch_ffn_combine kernel, does not support the dynamic expert layout changes that dynamic_eplb introduces. The fix correctly disables FUSED_ALLTOALL when dynamic_eplb is enabled, falling back to the compatible ALLTOALL method. The change is implemented cleanly by introducing a fused_all2all_enable variable, which improves code readability. The fix is correct and necessary for the proper functioning of dynamic load balancing.

github-actions · 2025-12-11T10:48:54Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

github-actions · 2025-12-12T09:30:50Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: shenchuxiaofugui <[email protected]>

gemini-code-assist bot reviewed Dec 11, 2025

View reviewed changes

shenchuxiaofugui force-pushed the ffn_fused branch 2 times, most recently from 884d60f to ce3074d Compare December 11, 2025 11:17

MengqingCao added ready read for review ready-for-test start test by label for PR labels Dec 12, 2025

github-actions bot added the merge-conflicts label Dec 12, 2025

shenchuxiaofugui force-pushed the ffn_fused branch from ce3074d to 38be5c3 Compare December 13, 2025 07:42

github-actions bot removed the merge-conflicts label Dec 13, 2025

[Bugfix] dynamic eplb does't use fused_alltoall

69b65da

Signed-off-by: shenchuxiaofugui <[email protected]>

shenchuxiaofugui force-pushed the ffn_fused branch from 38be5c3 to 69b65da Compare December 13, 2025 09:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] dynamic eplb does't use fused_alltoall #4919

[Bugfix] dynamic eplb does't use fused_alltoall #4919

shenchuxiaofugui commented Dec 11, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

github-actions bot commented Dec 11, 2025

Uh oh!

github-actions bot commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Bugfix] dynamic eplb does't use fused_alltoall #4919

Are you sure you want to change the base?

[Bugfix] dynamic eplb does't use fused_alltoall #4919

Conversation

shenchuxiaofugui commented Dec 11, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions bot commented Dec 11, 2025

Uh oh!

github-actions bot commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

shenchuxiaofugui commented Dec 11, 2025 •

edited by github-actions bot

Loading