From 34e1098db8e9c7fe2341addbc961258843db4de5 Mon Sep 17 00:00:00 2001 From: AiChiMomo <1092626063@qq.com> Date: Tue, 2 Dec 2025 21:42:20 +0800 Subject: [PATCH] doc fix: deepseekv3.1 Signed-off-by: AiChiMomo <1092626063@qq.com> --- docs/source/tutorials/DeepSeek-V3.1.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/source/tutorials/DeepSeek-V3.1.md b/docs/source/tutorials/DeepSeek-V3.1.md index b1c4f85df3d..5db808a1f02 100644 --- a/docs/source/tutorials/DeepSeek-V3.1.md +++ b/docs/source/tutorials/DeepSeek-V3.1.md @@ -254,7 +254,7 @@ vllm serve /weights/DeepSeek-V3.1_w8a8mix_mtp \ ### Prefill-Decode Disaggregation -There are two ways to deploy `Prefill-Decode Disaggregation`: [Llmdatadist](./multi_node_pd_disaggregation_llmdatadist.md) and [Mooncake](./multi_node_pd_disaggregation_mooncake.md). We recommend use Mooncake for deploy. +We recommend using Mooncake for deployment: [Mooncake](./multi_node_pd_disaggregation_mooncake.md). Take Atlas 800 A3 (64G × 16) for example, we recommend to deploy 2P1D (4 nodes) rather than 1P1D (2 nodes), because there is no enough NPU memory to serve high concurrency in 1P1D case. - `DeepSeek-V3.1_w8a8mix_mtp 2P1D Layerwise` require 4 Atlas 800 A3 (64G × 16).