[perf]Allow setting cudagraph_capture_sizes in VllmRunner #4694

MrZ20 · 2025-12-04T03:28:39Z

What this PR does / why we need it?

Allow setting cudagraph_capture_sizes in VllmRunner

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.12.0
vLLM main: https://github.com/vllm-project/vllm/commit/v0.12.0

Signed-off-by: MrZ20 <[email protected]>

gemini-code-assist

Code Review

This pull request introduces the cudagraph_capture_sizes parameter to VllmRunner initializations across numerous test files, setting it to [4]. This is a good step towards standardizing test configurations for CUDA graph capturing.

However, I've identified several instances where cudagraph_capture_sizes is defined both as a direct argument to VllmRunner and within the compilation_config dictionary. In these cases, the value inside compilation_config takes precedence, rendering the newly added direct argument ineffective. This could lead to confusion and incorrect test behavior. I've left specific comments with suggestions to remove the redundant definitions to ensure clarity and correctness.

gemini-code-assist · 2025-12-04T03:30:36Z

tests/e2e/multicard/test_full_graph_mode.py

    with VllmRunner(model,
                    max_model_len=1024,
                    tensor_parallel_size=2,
+                    cudagraph_capture_sizes=[4],


The added cudagraph_capture_sizes=[4] argument is ineffective because compilation_config on line 46 also defines cudagraph_capture_sizes. The value from compilation_config takes precedence, which can lead to confusion. Please remove this redundant argument and rely on the one in compilation_config.

gemini-code-assist · 2025-12-04T03:30:36Z

tests/e2e/multicard/test_full_graph_mode.py

    with VllmRunner(model,
                    max_model_len=1024,
                    tensor_parallel_size=2,
+                    cudagraph_capture_sizes=[4],


The added cudagraph_capture_sizes=[4] argument is ineffective because compilation_config on line 94 also defines cudagraph_capture_sizes. The value from compilation_config takes precedence, which can lead to confusion. Please remove this redundant argument and rely on the one in compilation_config.

gemini-code-assist · 2025-12-04T03:30:36Z

tests/e2e/multicard/test_shared_expert_dp.py

            model,
            max_model_len=1024,
            tensor_parallel_size=2,
+            cudagraph_capture_sizes=[4],


The added cudagraph_capture_sizes=[4] argument is ineffective because compilation_config on line 58 also defines cudagraph_capture_sizes. The value from compilation_config takes precedence, which can lead to confusion. Please remove this redundant argument and rely on the one in compilation_config.

gemini-code-assist · 2025-12-04T03:30:36Z

tests/e2e/singlecard/test_aclgraph.py

        with VllmRunner(
                model,
                max_model_len=1024,
+                cudagraph_capture_sizes=[4],


The added cudagraph_capture_sizes=[4] argument is ineffective because compilation_config on line 176 also defines cudagraph_capture_sizes. The value from compilation_config takes precedence, which can lead to confusion. Please remove this redundant argument and rely on the one in compilation_config.

github-actions · 2025-12-04T04:13:01Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

add cudagraph_capture_sizes

c6ca7ea

Signed-off-by: MrZ20 <[email protected]>

gemini-code-assist bot reviewed Dec 4, 2025

View reviewed changes

github-actions bot added the module:tests label Dec 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[perf]Allow setting cudagraph_capture_sizes in VllmRunner #4694

[perf]Allow setting cudagraph_capture_sizes in VllmRunner #4694

Uh oh!

MrZ20 commented Dec 4, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 4, 2025

Uh oh!

gemini-code-assist bot Dec 4, 2025

Uh oh!

gemini-code-assist bot Dec 4, 2025

Uh oh!

gemini-code-assist bot Dec 4, 2025

Uh oh!

github-actions bot commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[perf]Allow setting cudagraph_capture_sizes in VllmRunner #4694

Are you sure you want to change the base?

[perf]Allow setting cudagraph_capture_sizes in VllmRunner #4694

Uh oh!

Conversation

MrZ20 commented Dec 4, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

MrZ20 commented Dec 4, 2025 •

edited by github-actions bot

Loading