[doc] Add Qwen2.5 tutorials #4636

yangshihao6 · 2025-12-02T09:47:17Z

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.12.0
vLLM main: https://github.com/vllm-project/vllm/commit/v0.12.0

Model module is useless now. Let't remove it totally. - vLLM version: v0.11.2 Signed-off-by: yangshihao6 <[email protected]>

gemini-code-assist

Code Review

This pull request adds a new tutorial for deploying the Qwen2.5-7B-Instruct model. The tutorial is comprehensive but appears to be an internal document that has been published without being adapted for a public audience. It contains several internal links, references to internal Docker images, and network-specific instructions that make it impossible for external users to follow. My review focuses on highlighting these issues to ensure the documentation is accessible and reproducible for the open-source community.

gemini-code-assist · 2025-12-02T09:48:48Z

docs/source/tutorials/Qwen2.5-7b.md

+-v /usr/local/Ascend/driver:/usr/local/Ascend/driver \
+-v /home/:/home \
+-w /home/<name> \
+mindie:dev-2.2.RC1.B070-800I-A2-py312-ubuntu22.03-x86_64 \


The Docker image mindie:dev-2.2.RC1.B070-800I-A2-py312-ubuntu22.03-x86_64 appears to be an internal image, which is not available to the public. For this tutorial to be reproducible by the open-source community, please use a publicly available base image and provide instructions to build upon it if necessary.

gemini-code-assist · 2025-12-02T09:48:48Z

docs/source/tutorials/Qwen2.5.md

+:::{note}
+The server may be in a restricted network zone (Yellow Zone) and require a Green Zone proxy tool for internet access. Download the proxy tool from the internal repository, run `PortMapping.exe` to obtain the proxy IP, and update `ip_addr` in `portproxy_remote.sh` before executing the script.
+:::


The note about "Yellow Zone", "Green Zone proxy tool", and internal repositories is specific to a corporate network environment. This information is not relevant or helpful for external users and makes the documentation confusing. Please remove these internal-specific instructions to make the guide generally applicable.

gemini-code-assist · 2025-12-02T09:48:48Z

docs/source/tutorials/Qwen2.5.md

+Qwen2.5-7B-Instruct supports single-node single-card deployment on the 910B4 platform. Follow these steps to start the inference service:
+
+1. Prepare model weights: Ensure the downloaded model weights are stored in the `./Qwen2.5-7B-Instruct/` directory.
+2. Download the gsm8k dataset (for evaluation): [gsm8k.zip](https://vision-file-storage/api/file/download/attachment-v2/WIKI202511118986704/32978033/20251111T144846Z_9658c67a0fb349f9be081ab9ab9fd2bc.zip?attachment_id=32978033)


The link to the gsm8k.zip dataset points to an internal vision-file-storage URL, which is not accessible to the public. This prevents users from following the tutorial. Please replace it with a public link to the dataset, for example, from the Hugging Face Hub.

gemini-code-assist · 2025-12-02T09:48:49Z

docs/source/tutorials/Qwen2.5.md

+
+#### Evaluation Results
+Results and logs are saved to `benchmark/outputs/default/`. A sample accuracy report is shown below:
+![Accuracy Evaluation Result](https://wiki.huawei.com/vision-file-storage/api/file/download/upload-v2/WIKI202511118986704/32976454/30bf146f86ab472697430f8efae66c1a.png)


The image for "Accuracy Evaluation Result" is hosted on an internal wiki.huawei.com server and is inaccessible to the public. To make the documentation useful for everyone, please embed the image directly into the repository and use a relative path. This also applies to the "Performance Evaluation Result" image on line 307.

github-actions · 2025-12-02T10:17:23Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Model module is useless now. Let't remove it totally. - vLLM version: v0.11.2 Signed-off-by: yangshihao6 <[email protected]>

# Conflicts: # docs/source/tutorials/Qwen2.5.md

Model module is useless now. Let't remove it totally. - vLLM version: v0.11.2 Signed-off-by: yangshihao6 <[email protected]>

wangxiyuan and others added 2 commits December 2, 2025 17:44

clean up model module (vllm-project#4611)

2befd9f

Model module is useless now. Let't remove it totally. - vLLM version: v0.11.2 Signed-off-by: yangshihao6 <[email protected]>

Merge remote-tracking branch 'origin/main'

01db67c

gemini-code-assist bot reviewed Dec 2, 2025

View reviewed changes

clean up model module (vllm-project#4611)

1715299

Model module is useless now. Let't remove it totally. - vLLM version: v0.11.2 Signed-off-by: yangshihao6 <[email protected]>

github-actions bot added the documentation Improvements or additions to documentation label Dec 2, 2025

clean up model module (vllm-project#4611)

0fbb025

Model module is useless now. Let't remove it totally. - vLLM version: v0.11.2 Signed-off-by: yangshihao6 <[email protected]>

yangshihao6 changed the title ~~[doc] Add Qwen2.5-7b tutorials~~ [doc] Add Qwen2.5 tutorials Dec 3, 2025

yangshihao6 added 4 commits December 4, 2025 11:28

clean up model module (vllm-project#4611)

646154c

Model module is useless now. Let't remove it totally. - vLLM version: v0.11.2 Signed-off-by: yangshihao6 <[email protected]>

Merge branch 'vllm-project:main' into main

47bb977

Merge remote-tracking branch 'origin/main'

dc7dedc

# Conflicts: # docs/source/tutorials/Qwen2.5.md

clean up model module (vllm-project#4611)

a01ee7a

Model module is useless now. Let't remove it totally. - vLLM version: v0.11.2 Signed-off-by: yangshihao6 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[doc] Add Qwen2.5 tutorials #4636

[doc] Add Qwen2.5 tutorials #4636

yangshihao6 commented Dec 2, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 2, 2025

Uh oh!

gemini-code-assist bot Dec 2, 2025

Uh oh!

gemini-code-assist bot Dec 2, 2025

Uh oh!

gemini-code-assist bot Dec 2, 2025

Uh oh!

github-actions bot commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[doc] Add Qwen2.5 tutorials #4636

Are you sure you want to change the base?

[doc] Add Qwen2.5 tutorials #4636

Conversation

yangshihao6 commented Dec 2, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yangshihao6 commented Dec 2, 2025 •

edited by github-actions bot

Loading