Skip to content

Commit c95c271

Browse files
authored
[E2E] Optimize nightly testcase. (#4886)
### What this PR does / why we need it? Optimize nightly testcase. Changes: - tests/e2e/nightly/multi_node/config/models/Qwen3-235B-A3B.yaml: Add accuracy and performance benchmark - tests/e2e/models/configs/Qwen3-8B-Base.yaml: Delete - tests/e2e/models/configs/internlm-7b.yaml: Change to internlm3-8b-instruct - tests/e2e/nightly/models/test_deepseek_r1_w8a8_eplb.py: Change to DeepSeek-R1-0528-W8A8 model - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e Signed-off-by: menogrey <[email protected]>
1 parent 66b0781 commit c95c271

File tree

8 files changed

+17
-20
lines changed

8 files changed

+17
-20
lines changed

.github/workflows/vllm_ascend_test_nightly_a3.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,7 @@ jobs:
5353
config_file_path: DeepSeek-V3.yaml
5454
size: 2
5555
- name: multi-node-qwen3-dp
56-
config_file_path: Qwen3-235B-A3B.yaml
56+
config_file_path: Qwen3-235B-A22B.yaml
5757
size: 2
5858
- name: multi-node-dpsk-4node-pd
5959
config_file_path: DeepSeek-R1-W8A8.yaml

docs/source/developer_guide/contribution/multi_node_test.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -142,7 +142,7 @@ currently, the multi-node test workflow defined in the [vllm_ascend_test_nightly
142142
config_file_path: tests/e2e/nightly/multi_node/config/models/DeepSeek-V3.yaml
143143
size: 2
144144
- name: multi-node-qwen3-dp
145-
config_file_path: tests/e2e/nightly/multi_node/config/models/Qwen3-235B-A3B.yaml
145+
config_file_path: tests/e2e/nightly/multi_node/config/models/Qwen3-235B-A22B.yaml
146146
size: 2
147147
- name: multi-node-dpsk-4node-pd
148148
config_file_path: tests/e2e/nightly/multi_node/config/models/DeepSeek-R1-W8A8.yaml

tests/e2e/models/configs/Qwen3-8B-Base.yaml

Lines changed: 0 additions & 14 deletions
This file was deleted.

tests/e2e/models/configs/accuracy.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,6 @@ InternVL3-8B.yaml
1212
InternVL3_5-8B.yaml
1313
ERNIE-4.5-21B-A3B-PT.yaml
1414
gemma-3-4b-it.yaml
15-
internlm-7b.yaml
15+
internlm3-8b-instruct.yaml
1616
Molmo-7B-D-0924.yaml
1717
llava-1.5-7b-hf.yaml

tests/e2e/models/configs/internlm-7b.yaml renamed to tests/e2e/models/configs/internlm3-8b-instruct.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
model_name: "Shanghai_AI_Laboratory/internlm-7b"
1+
model_name: "Shanghai_AI_Laboratory/internlm3-8b-instruct"
22
hardware: "Atlas A2 Series"
33
tasks:
44
- name: "ceval-valid"

tests/e2e/models/conftest.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ def pytest_addoption(parser):
2121
parser.addoption(
2222
"--config",
2323
action="store",
24-
default="./tests/e2e/models/configs/Qwen3-8B-Base.yaml",
24+
default="./tests/e2e/models/configs/Qwen3-8B.yaml",
2525
help="Path to the model config YAML file",
2626
)
2727
parser.addoption(

tests/e2e/nightly/models/test_deepseek_r1_w8a8_eplb.py renamed to tests/e2e/nightly/models/test_deepseek_r1_0528_w8a8_eplb.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@
2525
from tools.aisbench import run_aisbench_cases
2626

2727
MODELS = [
28-
"vllm-ascend/DeepSeek-R1-W8A8",
28+
"vllm-ascend/DeepSeek-R1-0528-W8A8",
2929
]
3030

3131
prompts = [

tests/e2e/nightly/multi_node/config/models/Qwen3-235B-A3B.yaml renamed to tests/e2e/nightly/multi_node/config/models/Qwen3-235B-A22B.yaml

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -48,6 +48,17 @@ deployment:
4848
--no-enable-prefix-caching
4949
--gpu-memory-utilization 0.9
5050
benchmarks:
51+
perf:
52+
case_type: performance
53+
dataset_path: vllm-ascend/GSM8K-in3500-bs2800
54+
request_conf: vllm_api_stream_chat
55+
dataset_conf: gsm8k/gsm8k_gen_0_shot_cot_str_perf
56+
num_prompts: 2800
57+
max_out_len: 1500
58+
batch_size: 700
59+
request_rate: 11.2
60+
baseline: 1
61+
threshold: 0.97
5162
acc:
5263
case_type: accuracy
5364
dataset_path: vllm-ascend/gsm8k

0 commit comments

Comments
 (0)