Skip to content

Commit 8411f8d

Browse files
Update docs/source/tutorials/Qwen3-235B.md
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: xuyexiong <[email protected]>
1 parent a230d29 commit 8411f8d

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

docs/source/tutorials/Qwen3-235B.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -83,7 +83,7 @@ vllm serve vllm-ascend/Qwen3-235B-A22B-w8a8 \
8383
--gpu-memory-utilization 0.95 \
8484
--rope-scaling '{"rope_type":"yarn","factor":4,"original_max_position_embeddings":32768}' \
8585
--additional-config '{"ascend_scheduler_config":{"enabled":false}}' \
86-
--compilation-config '{"cudagraph_capture_sizes":[1,4,8],"cudagraph_mode":"FULL_DECODE_ONLY"}' \
86+
--compilation-config '{"cudagraph_capture_sizes":[1,4],"cudagraph_mode":"FULL_DECODE_ONLY"}' \
8787
--async-scheduling
8888
```
8989

0 commit comments

Comments
 (0)