Skip to content

Commit 0c04bf1

Browse files
[Fixbug] Fix accuracy for DeepSeek-V2-Lite (#3016)
### What this PR does / why we need it? Fix accuracy for DeepSeek-V2-Lite ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? CI passed - vLLM version: v0.10.2 - vLLM main: vllm-project/vllm@66072b3 Signed-off-by: hfadzxy <[email protected]>
1 parent 367edff commit 0c04bf1

File tree

2 files changed

+4
-2
lines changed

2 files changed

+4
-2
lines changed

tests/e2e/models/configs/DeepSeek-V2-Lite.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,8 @@ tasks:
77
- name: "exact_match,flexible-extract"
88
value: 0.375
99
tensor_parallel_size: 2
10+
batch_size: 8
11+
gpu_memory_utilization: 0.7
1012
apply_chat_template: False
1113
fewshot_as_multiturn: False
1214
trust_remote_code: True

tests/e2e/models/test_lm_eval_correctness.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -84,7 +84,7 @@ def generate_report(tp_size, eval_config, report_data, report_dir, env_config):
8484
apply_chat_template=eval_config.get("apply_chat_template", True),
8585
fewshot_as_multiturn=eval_config.get("fewshot_as_multiturn", True),
8686
limit=eval_config.get("limit", "N/A"),
87-
batch_size="auto",
87+
batch_size=eval_config.get("batch_size", "auto"),
8888
num_fewshot=eval_config.get("num_fewshot", "N/A"),
8989
rows=report_data["rows"],
9090
parallel_mode=parallel_mode)
@@ -110,7 +110,7 @@ def test_lm_eval_correctness_param(config_filename, tp_size, report_dir,
110110
"apply_chat_template": eval_config.get("apply_chat_template", True),
111111
"fewshot_as_multiturn": eval_config.get("fewshot_as_multiturn", True),
112112
"limit": eval_config.get("limit", None),
113-
"batch_size": "auto",
113+
"batch_size": eval_config.get("batch_size", "auto"),
114114
}
115115
for s in ["num_fewshot", "fewshot_as_multiturn", "apply_chat_template"]:
116116
val = eval_config.get(s, None)

0 commit comments

Comments
 (0)