-
-
Notifications
You must be signed in to change notification settings - Fork 11.3k
[CI/Build]Add eval config for Qwen3-235B-A22B-Instruct-2507-FP8 #27113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI/Build]Add eval config for Qwen3-235B-A22B-Instruct-2507-FP8 #27113
Conversation
ca0144d to
ed452ff
Compare
.buildkite/lm-eval-harness/configs/Qwen3-235B-A22B-Thinking-2507-FP8.yaml
Outdated
Show resolved
Hide resolved
8720e75 to
303c0c5
Compare
418d612 to
bb9a649
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
💡 Codex Review
Here are some automated review suggestions for this pull request.
ℹ️ About Codex in GitHub
Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".
1c7f37a to
30f2bc0
Compare
.buildkite/lm-eval-harness/configs/Qwen3-235B-A22B-Thinking-2507-FP8.yaml
Outdated
Show resolved
Hide resolved
|
LGTM, have we verified by triggering an example CI on buildkite? |
30f2bc0 to
99d95b6
Compare
Thanks! I did it once over the weekend. Let me schedule a new one https://buildkite.com/vllm/ci/builds/35613/steps/canvas LM Eval Small Models https://buildkite.com/vllm/ci/builds/35613/steps/canvas?sid=019a0362-abbd-407f-8d10-243f063b0792 LM Eval Large Models (H200) https://buildkite.com/vllm/ci/builds/35613/steps/canvas?jid=019a04d4-029e-4ed2-90e6-ccc726f215d1 |
.buildkite/lm-eval-harness/configs/Qwen3-235B-A22B-Thinking-2507-FP8.yaml
Outdated
Show resolved
Hide resolved
99d95b6 to
6003d66
Compare
| @@ -1 +1,2 @@ | |||
| Meta-Llama-4-Maverick-17B-128E-Instruct-FP8.yaml | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we also want to test llama4? Or we can just start with Qwen?
ef047d9 to
b09ee61
Compare
bfcf20c to
69ac35f
Compare
|
how long does it take with and without VLLM_USE_DEEP_GEMM? sounds like qwen model is not doing good with deepgemm? cc: @minosfuture @houseroad |
there seems some build issue at the moment (sample failure, error msg |
01794c3 to
33c059c
Compare
|
@yeqcharlotte - updated the test result in the Test Result section. The fastest I can get using H100 CI is about 24 minutes. As a comparison, the same setting spent about 15 minutes on my local H100. One obvious difference between the two H100 are - my local H100 is 96GB and CI H100 is 80GB. |
Signed-off-by: Huamin Li <[email protected]>
33c059c to
0222140
Compare
|
let's get it started! |
…-project#27113) Signed-off-by: Huamin Li <[email protected]>
…-project#27113) Signed-off-by: Huamin Li <[email protected]>
…-project#27113) Signed-off-by: Huamin Li <[email protected]>
…-project#27113) Signed-off-by: Huamin Li <[email protected]>
…-project#27113) Signed-off-by: Huamin Li <[email protected]> Signed-off-by: Eldar Kurtic <[email protected]>
Purpose
.buildkite/lm-eval-harness/configs/models-large-h100.txtto use.buildkite/lm-eval-harness/configs/models-large-hopper.txtper suggestion.label: LM Eval Large Models (H100) # optionalto runconfigs/models-large-hopper.txtTest Plan
Test Result
WIP
VLLM_USE_DEEP_GEMM=1
1 passed, 104 warnings in 3062.67s (0:51:02)https://buildkite.com/vllm/ci/builds/36443/steps/canvas?sid=019a272c-b17c-4404-96e9-0e83356d6bb8
VLLM_USE_DEEP_GEMM=1
1 passed, 104 warnings in 3041.13s (0:50:41)https://buildkite.com/vllm/ci/builds/36457/steps/canvas?sid=019a27b5-4ec8-4166-8bba-091034e14805
VLLM_USE_DEEP_GEMM=0
1 passed, 104 warnings in 1900.52s (0:31:40)https://buildkite.com/vllm/ci/builds/36470/steps/canvas?sid=019a2826-8d7a-4453-925b-472e24acf179
VLLM_USE_DEEP_GEMM=0 + kv_cache_dtype=fp8
1 passed, 104 warnings in 1445.30s (0:24:05)https://buildkite.com/vllm/ci/builds/36607/steps/canvas?sid=019a2b72-d030-4a51-aefa-ff0838ffc8fb
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.