PCP/DCP 适配mooncake layerwise connector #4924

ksiyuan · 2025-12-11T11:18:23Z

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

gemini-code-assist

Code Review

此 PR 旨在为 mooncake layerwise connector 适配 PCP/DCP。改动范围较广，并引入了处理分布式 KV 缓存传输的复杂逻辑。我发现了一些严重问题，涉及不正确的端口计算、竞争条件和错误的数据切片，这些问题可能导致通信失败或行为不正确。为了确保实现的正确性，应解决这些问题。

vllm_ascend/distributed/mooncake_layerwise_connector.py

gemini-code-assist · 2025-12-11T11:20:53Z

vllm_ascend/distributed/mooncake_layerwise_connector.py

+        remote_block_offset = 0
+        for local_kv_id in range(len(remote_handshake_port_list)):
+            num_blocks_to_push = local_block_nums[local_kv_id]
+            local_block_ids_list.append(
+                meta.local_block_ids[:num_blocks_to_push])
+            remote_block_ids_list.append(
+                meta.remote_block_ids[remote_block_offset:remote_block_offset+
+                num_blocks_to_push])
+            remote_block_offset += num_blocks_to_push


切分 meta.local_block_ids 的逻辑不正确。在每次循环迭代中，它都重复地从列表的开头进行切片（meta.local_block_ids[:num_blocks_to_push]）。这将导致所有切分都使用相同的初始本地块集合，从而导致不正确的数据传输。您应该使用一个偏移量来在每次迭代中切分 meta.local_block_ids 的正确部分，类似于 remote_block_offset 用于 meta.remote_block_ids 的方式。

remote_block_offset = 0 local_block_offset = 0 for local_kv_id in range(len(remote_handshake_port_list)): num_blocks_to_push = local_block_nums[local_kv_id] local_block_ids_list.append( meta.local_block_ids[local_block_offset:local_block_offset + num_blocks_to_push]) remote_block_ids_list.append( meta.remote_block_ids[remote_block_offset:remote_block_offset + num_blocks_to_push]) remote_block_offset += num_blocks_to_push local_block_offset += num_blocks_to_push

vllm_ascend/distributed/mooncake_layerwise_connector.py

github-actions · 2025-12-11T11:51:35Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist bot reviewed Dec 11, 2025

View reviewed changes

ksiyuan force-pushed the main branch 5 times, most recently from b77ab28 to ee4baa3 Compare December 13, 2025 08:14

PCP/DCP 适配mooncake layerwise connector

1324a5b

ksiyuan force-pushed the main branch from ee4baa3 to 1324a5b Compare December 13, 2025 08:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PCP/DCP 适配mooncake layerwise connector #4924

PCP/DCP 适配mooncake layerwise connector #4924

ksiyuan commented Dec 11, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Dec 11, 2025

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

PCP/DCP 适配mooncake layerwise connector #4924

Are you sure you want to change the base?

PCP/DCP 适配mooncake layerwise connector #4924

Conversation

ksiyuan commented Dec 11, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ksiyuan commented Dec 11, 2025 •

edited by github-actions bot

Loading