bugfix: support disagg PD for MTP. #551

RobbieLeung · 2025-12-16T07:04:18Z

No description provided.

xllm/core/framework/kv_cache/spec_kv_cache_transfer.cpp

zhang-minchao · 2026-01-04T09:29:10Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the KV cache transfer mechanism, seemingly to support disaggregated pipeline parallelism for MTP. The changes primarily involve removing the logic for transferring embeddings and modifying the KV cache allocation and deallocation to use direct aclrt calls instead of the llm_data_dist abstraction. The refactoring of push_kv_blocks to consolidate duplicated logic is a good improvement for maintainability.

My review includes suggestions to remove dead code, fix a potential type-safety issue with pointer casting, and improve code style and memory management. Overall, the changes are logical and well-structured.

xllm/core/framework/kv_cache/spec_kv_cache_transfer.cpp

xllm/core/runtime/speculative_worker_impl.cpp

RobbieLeung · 2026-01-04T12:01:52Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the KV cache transfer mechanism, particularly for speculative decoding on NPU, to support disaggregated pipeline parallelism. The key changes include:

Modifying the KV cache allocation to use aclrtMalloc for direct device memory allocation followed by registration with llm_data_dist, instead of relying on llm_data_dist for allocation.
Removing the logic for handling a separate embedding cache transfer, which simplifies the SpecKVCacheTransfer class by removing several functions and member variables.
Refactoring push_kv_blocks and push_kv_blocks_spec to eliminate code duplication by introducing a common push_kv_blocks_internal function.

The changes are well-structured and improve code clarity and maintainability. I have a couple of suggestions to improve robustness and clarity: one regarding the use of a logging macro with a misleading error message, and another to prevent a potential null pointer dereference.

xllm/core/framework/kv_cache/spec_kv_cache_transfer.cpp

RobbieLeung requested review from JimHsiung, liutongxuan and yq33victor December 16, 2025 07:04

DragonFive self-requested a review December 17, 2025 10:48

JimHsiung previously approved these changes Dec 18, 2025

View reviewed changes

XuZhang99 changed the title ~~bugfix: support PD for MTP.~~ bugfix: support disagg PD for MTP. Dec 18, 2025

DragonFive reviewed Dec 18, 2025

View reviewed changes

xllm/core/framework/kv_cache/spec_kv_cache_transfer.cpp Outdated Show resolved Hide resolved

RobbieLeung dismissed JimHsiung’s stale review via 3847fc1 December 29, 2025 06:07

RobbieLeung force-pushed the feat/mtp_pd branch from 1fad4fe to 3847fc1 Compare December 29, 2025 06:07

RobbieLeung requested review from DongheJin, XuZhang99 and walsonyang as code owners December 29, 2025 06:07

XuZhang99 previously approved these changes Dec 29, 2025

View reviewed changes

DongheJin previously approved these changes Jan 4, 2026

View reviewed changes

gemini-code-assist bot reviewed Jan 4, 2026

View reviewed changes

RobbieLeung dismissed stale reviews from DongheJin and XuZhang99 via fb4069c January 4, 2026 11:58

RobbieLeung force-pushed the feat/mtp_pd branch from 3847fc1 to fb4069c Compare January 4, 2026 11:58

gemini-code-assist bot reviewed Jan 4, 2026

View reviewed changes

xllm/core/framework/kv_cache/spec_kv_cache_transfer.cpp Show resolved Hide resolved

xllm/core/framework/kv_cache/spec_kv_cache_transfer.cpp Outdated Show resolved Hide resolved

bugfix: support disagg PD for MTP.

881bf6c

RobbieLeung force-pushed the feat/mtp_pd branch from fb4069c to 881bf6c Compare January 4, 2026 12:16

DongheJin approved these changes Jan 5, 2026

View reviewed changes

XuZhang99 approved these changes Jan 5, 2026

View reviewed changes

RobbieLeung merged commit 6493d8e into jd-opensource:main Jan 5, 2026
17 of 19 checks passed

RobbieLeung deleted the feat/mtp_pd branch January 5, 2026 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix: support disagg PD for MTP. #551

bugfix: support disagg PD for MTP. #551

Uh oh!

RobbieLeung commented Dec 16, 2025

Uh oh!

Uh oh!

zhang-minchao commented Jan 4, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RobbieLeung commented Jan 4, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

bugfix: support disagg PD for MTP. #551

bugfix: support disagg PD for MTP. #551

Uh oh!

Conversation

RobbieLeung commented Dec 16, 2025

Uh oh!

Uh oh!

zhang-minchao commented Jan 4, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RobbieLeung commented Jan 4, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants