[BugFix] Fix deepseek v3.2 mtp bug. #3900

whx-sjtu · 2025-10-30T09:36:08Z

What this PR does / why we need it?

This PR fixes deepseek v3.2 mtp bug.

Does this PR introduce any user-facing change?

None

How was this patch tested?

All existed ci tests should pass.

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@83f478b

github-actions · 2025-10-30T09:37:09Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This PR addresses a bug in the deepseek v3.2 mtp implementation by incorporating the DeepseekV32IndexerCache class and adjusting the logic for identifying attention layers in the draft model. The changes ensure that the indexer layers are correctly excluded from the draft attention layers, preventing potential errors during speculative decoding.

vllm_ascend/spec_decode/mtp_proposer.py

Signed-off-by: whx-sjtu <2952154980@qq.com>

gemini-code-assist bot reviewed Oct 30, 2025

View reviewed changes

vllm_ascend/spec_decode/mtp_proposer.py Outdated Show resolved Hide resolved

vllm_ascend/spec_decode/mtp_proposer.py Outdated Show resolved Hide resolved

fix deepseek 3.2 mtp bug

d317577

Signed-off-by: whx-sjtu <2952154980@qq.com>

whx-sjtu force-pushed the fix_32_mtp branch from b2de33f to d317577 Compare October 31, 2025 01:44

whx-sjtu added ready read for review ready-for-test start test by label for PR labels Oct 31, 2025

add notes

926305f

Signed-off-by: whx-sjtu <2952154980@qq.com>

wangxiyuan approved these changes Oct 31, 2025

View reviewed changes

wangxiyuan approved these changes Nov 4, 2025

View reviewed changes

wangxiyuan merged commit e9bb449 into vllm-project:main Nov 4, 2025
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix] Fix deepseek v3.2 mtp bug. #3900

[BugFix] Fix deepseek v3.2 mtp bug. #3900

whx-sjtu commented Oct 30, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[BugFix] Fix deepseek v3.2 mtp bug. #3900

[BugFix] Fix deepseek v3.2 mtp bug. #3900

Conversation

whx-sjtu commented Oct 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

whx-sjtu commented Oct 30, 2025 •

edited by github-actions bot

Loading