Skip to content

Conversation

@kunpengW-code
Copy link
Contributor

@kunpengW-code kunpengW-code commented Dec 1, 2025

What this PR does / why we need it?

Fix the issue where the qwen3 moe service cannot be started due to upgrading the vllm version

Error info:
AttributeError: 'AscendFusedMoE' object has no attribute 'use dp chunking'

Does this PR introduce any user-facing change?

no

How was this patch tested?

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request addresses a bug in the Qwen3 MoE quantization by correctly initializing the base class for AscendFusedMoEMethod. The change ensures that the FusedMoEMethodBase constructor is called with the appropriate moe_config from the layer.

quant_config: AscendQuantConfig,
prefix: str,
packed_modules_mapping: Dict[str, Any],
layer: torch.nn.Module = None):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

The layer parameter in the __init__ method is defined with a default value of None, but layer.moe_config is accessed unconditionally on the next line. This creates a potential AttributeError if AscendFusedMoEMethod is ever instantiated without providing a layer argument. Given that layer is always expected to be a FusedMoE instance when this method is called (as seen in AscendQuantConfig.get_quant_method), it should be made a required argument to reflect its actual usage and prevent potential runtime errors.

Suggested change
layer: torch.nn.Module = None):
layer: torch.nn.Module):

@github-actions
Copy link

github-actions bot commented Dec 1, 2025

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

  • A PR should do only one thing, smaller PRs enable faster reviews.
  • Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
  • Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: Wang Kunpeng <[email protected]>
@github-actions
Copy link

github-actions bot commented Dec 1, 2025

This pull request has conflicts, please resolve those before we can evaluate the pull request.

…to qwen3-moe-bugfix

# Conflicts:
#	tests/e2e/multicard/test_data_parallel.py
@wangxiyuan wangxiyuan added ready read for review ready-for-test start test by label for PR labels Dec 1, 2025
@wangxiyuan wangxiyuan merged commit a9c4b86 into vllm-project:main Dec 1, 2025
47 checks passed
ChenCangtao pushed a commit to ChenCangtao/vllm-ascend that referenced this pull request Dec 3, 2025
### What this PR does / why we need it?
Fix the issue where the qwen3 moe service cannot be started due to
upgrading the vllm version

Error info:
AttributeError: 'AscendFusedMoE' object has no attribute 'use dp
chunking'

### Does this PR introduce _any_ user-facing change?
no


- vLLM version: v0.11.2

---------

Signed-off-by: Wang Kunpeng <[email protected]>
layer: torch.nn.Module = None):
def __init__(self, quant_config: AscendQuantConfig, prefix: str,
packed_modules_mapping: Dict[str,
Any], layer: torch.nn.Module):

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggest formatting it.

Mercykid-bash pushed a commit to Mercykid-bash/vllm-ascend that referenced this pull request Dec 4, 2025
### What this PR does / why we need it?
Fix the issue where the qwen3 moe service cannot be started due to
upgrading the vllm version

Error info:
AttributeError: 'AscendFusedMoE' object has no attribute 'use dp
chunking'

### Does this PR introduce _any_ user-facing change?
no

- vLLM version: v0.11.2

---------

Signed-off-by: Wang Kunpeng <[email protected]>
Signed-off-by: Che Ruan <[email protected]>
Mercykid-bash pushed a commit to Mercykid-bash/vllm-ascend that referenced this pull request Dec 4, 2025
### What this PR does / why we need it?
Fix the issue where the qwen3 moe service cannot be started due to
upgrading the vllm version

Error info:
AttributeError: 'AscendFusedMoE' object has no attribute 'use dp
chunking'

### Does this PR introduce _any_ user-facing change?
no

- vLLM version: v0.11.2

---------

Signed-off-by: Wang Kunpeng <[email protected]>
Signed-off-by: Che Ruan <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

module:quantization module:tests ready read for review ready-for-test start test by label for PR

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants