opti profiler default param #4609

zzzzwwjj · 2025-12-01T12:38:43Z

What this PR does / why we need it?

Set profiler's param aic_metrics to torch_npu.profiler.AiCMetrics.PipeUtilization by default.

This parameter can obtain more information for op.

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: 86e178f7c4d8c3b0eaf3c8e3f810a83f63b90e24
vLLM main: vllm-project/vllm@86e178f

gemini-code-assist

Code Review

This pull request updates the default aic_metrics for the torch NPU profiler to PipeUtilization from AiCoreNone. While enabling more detailed metrics by default is useful for profiling, hardcoding this value could cause issues on hardware where it's not supported or for users who wish to minimize profiling overhead. I have added a comment suggesting to make this parameter configurable via an environment variable to increase flexibility and robustness.

gemini-code-assist · 2025-12-01T12:39:39Z

vllm_ascend/worker/worker_v1.py

                profiler_level=torch_npu.profiler.ProfilerLevel.Level1,
                msprof_tx=False,
-                aic_metrics=torch_npu.profiler.AiCMetrics.AiCoreNone,
+                aic_metrics=torch_npu.profiler.AiCMetrics.PipeUtilization,


Hardcoding aic_metrics to PipeUtilization could be problematic. This metric might not be supported on all hardware versions, or some users might prefer to disable it to minimize profiling overhead. It would be more robust to make this configurable via an environment variable (e.g., VLLM_ASCEND_PROFILER_AIC_METRICS defined in vllm_ascend/envs.py). This would allow users to easily switch between different metrics like PipeUtilization and AiCoreNone based on their needs.

github-actions · 2025-12-01T13:07:29Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

whx-sjtu · 2025-12-02T11:39:32Z

vllm_ascend/envs.py

+    # 6: torch_npu.profiler.AiCMetrics.MemoryUB;
+    # 7: torch_npu.profiler.AiCMetrics.L2Cache;
+    # 8: torch_npu.profiler.AiCMetrics.MemoryAccess;
+    # If not set, it will be torch_npu.profiler.AiCMetrics.PipeUtilization by default.


Please provide examples of profilings with AiCMetrics.PipeUtilization enabled and disabled to show the performance degradation introduced by this metric.

Signed-off-by: zzzzwwjj <[email protected]>

gemini-code-assist bot reviewed Dec 1, 2025

View reviewed changes

zzzzwwjj force-pushed the add_aic_metrics branch from f85dfde to 0c99fa8 Compare December 2, 2025 01:59

github-actions bot added the module:core label Dec 2, 2025

zzzzwwjj force-pushed the add_aic_metrics branch from 0c99fa8 to c6b6cc0 Compare December 2, 2025 02:27

whx-sjtu reviewed Dec 2, 2025

View reviewed changes

opti profiler default param

8d7d93a

Signed-off-by: zzzzwwjj <[email protected]>

zzzzwwjj force-pushed the add_aic_metrics branch from c6b6cc0 to 8d7d93a Compare December 3, 2025 01:32

github-actions bot added the module:tests label Dec 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

opti profiler default param #4609

opti profiler default param #4609

zzzzwwjj commented Dec 1, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 1, 2025

Uh oh!

github-actions bot commented Dec 1, 2025

Uh oh!

whx-sjtu Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

opti profiler default param #4609

Are you sure you want to change the base?

opti profiler default param #4609

Conversation

zzzzwwjj commented Dec 1, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 1, 2025

Uh oh!

whx-sjtu Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zzzzwwjj commented Dec 1, 2025 •

edited by github-actions bot

Loading