[deepseek] kernel block size for UniformTypeKVCacheSpecs #26559

heheda12345 · 2025-10-10T03:40:28Z

Purpose

After #24486 , deepseek 3.2 will throw this error:

 NotImplementedError: unknown kv cache spec UniformTypeKVCacheSpecs

This PR fix it.

FIX #26524

Test Plan

python3 examples/offline_inference/basic/generate.py --model deepseek-ai/DeepSeek-V3.2-Exp --gpu_memory_utilization 0.8 -tp 8

Test Result

--------------------------------------------------
Prompt: 'Hello, my name is'
Generated text: ' Christian Munoz and\nthis is my final project for my summer\n2020'
--------------------------------------------------
Prompt: 'The president of the United States is'
Generated text: ' the head of state and head of government of the United States, indirectly elected to'
--------------------------------------------------
Prompt: 'The capital of France is'
Generated text: ' Paris, and the capital of Spain is Madrid.\n\n**Question:** The capital of'
--------------------------------------------------
Prompt: 'The future of AI is'
Generated text: ' the future of work\n\nThe future of AI is the future of work\n\nThe'
--------------------------------------------------

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Chen Zhang <zhangch99@outlook.com>

gemini-code-assist

Code Review

This pull request correctly adds support for UniformTypeKVCacheSpecs to address an issue with deepseek models. The approach of unpacking the spec and then handling it in the existing logic is sound. I've identified a minor improvement opportunity in the error reporting within gpu_model_runner.py to enhance debuggability for future unhandled spec types.

LucasWilkinson

LGTM; thanks for fixing!

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: Dhruvil Bhatt <bhattdbh@amazon.com>

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: bbartels <benjamin@bartels.dev>

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com>

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

fix block_size

98d9174

Signed-off-by: Chen Zhang <zhangch99@outlook.com>

heheda12345 requested review from LucasWilkinson, WoosukKwon, alexm-redhat, comaniac, njhill, robertgshaw2-redhat and ywang96 as code owners October 10, 2025 03:40

mergify bot added deepseek Related to DeepSeek models v1 labels Oct 10, 2025

gemini-code-assist bot reviewed Oct 10, 2025

View reviewed changes

heheda12345 mentioned this pull request Oct 10, 2025

Tracking Issue: DeepSeek V3.2 support #25877

Open

heheda12345 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 10, 2025

LucasWilkinson approved these changes Oct 10, 2025

View reviewed changes

heheda12345 merged commit 6f0f570 into vllm-project:main Oct 10, 2025
53 of 54 checks passed

heheda12345 deleted the fix_ds32_block_size branch October 10, 2025 08:40

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[deepseek] kernel block size for UniformTypeKVCacheSpecs (vllm-projec…

5c7ff52

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

heheda12345 mentioned this pull request Oct 10, 2025

[Bug]: prepare_kernel_block_sizes doesn't parse UniformTypeKVCacheSpecs #26524

Closed

1 task

Dhruvilbhatt pushed a commit to Dhruvilbhatt/vllm that referenced this pull request Oct 14, 2025

[deepseek] kernel block size for UniformTypeKVCacheSpecs (vllm-projec…

3a64cc7

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: Dhruvil Bhatt <bhattdbh@amazon.com>

bbartels pushed a commit to bbartels/vllm that referenced this pull request Oct 16, 2025

[deepseek] kernel block size for UniformTypeKVCacheSpecs (vllm-projec…

aaa3783

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: bbartels <benjamin@bartels.dev>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[deepseek] kernel block size for UniformTypeKVCacheSpecs (vllm-projec…

1e8f885

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[deepseek] kernel block size for UniformTypeKVCacheSpecs (vllm-projec…

0ccaf77

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[deepseek] kernel block size for UniformTypeKVCacheSpecs (vllm-projec…

4e1eb6d

…t#26559) Signed-off-by: Chen Zhang <zhangch99@outlook.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[deepseek] kernel block size for UniformTypeKVCacheSpecs #26559

[deepseek] kernel block size for UniformTypeKVCacheSpecs #26559

heheda12345 commented Oct 10, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

LucasWilkinson left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[deepseek] kernel block size for UniformTypeKVCacheSpecs #26559

[deepseek] kernel block size for UniformTypeKVCacheSpecs #26559

Conversation

heheda12345 commented Oct 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

LucasWilkinson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

heheda12345 commented Oct 10, 2025 •

edited by github-actions bot

Loading