[Bugfix][V1] Fix deepseek with v1 #958

MengqingCao · 2025-05-26T08:35:31Z

What this PR does / why we need it?

Fix deepseek with v1, this error is introdeced by #945. and this pr fix the block table of mla

How was this patch tested?

CI passed with new addedtest.

MengqingCao · 2025-05-28T09:56:30Z

@wangxiyuan This pr fixes the broken ci on v1+deepseek, now it is ready for review

Signed-off-by: Mengqing Cao <cmq0113@163.com>

Yikun · 2025-05-28T16:21:56Z

FAILED tests/singlecard/test_offline_inference.py::test_models[5-half-Qwen/Qwen2.5-0.5B-Instruct] - pydantic_core._pydantic_core.ValidationError: 2 validation errors for DeviceConfig
device.literal['auto','cuda','neuron','cpu','tpu','xpu','hpu']
  Input should be 'auto', 'cuda', 'neuron', 'cpu', 'tpu', 'xpu' or 'hpu' [type=literal_error, input_value='npu', input_type=str]

Unrelated CI failed, caused by vllm-project/vllm@4c2b38c and try to fix it on: vllm-project/vllm#18843

… main * 'main' of https://github.com/raindaywhu/vllm-ascend: [aclgraph] implentment NPUPiecewiseBackend to enable aclgraph (vllm-project#836) [Bugfix][V1] Fix deepseek with v1 (vllm-project#958) [Perf] Refactor tensor disposal logic to reduce memory usage (vllm-project#966)

### What this PR does / why we need it? Fix deepseek with v1, this error is introdeced by vllm-project#945. and this pr fix the block table of mla ### How was this patch tested? CI passed with new addedtest. Signed-off-by: Mengqing Cao <cmq0113@163.com>

github-actions bot added the module:tests label May 26, 2025

MengqingCao force-pushed the fix branch from 7584bd3 to 1991d8e Compare May 28, 2025 01:02

github-actions bot added the module:tools label May 28, 2025

MengqingCao force-pushed the fix branch from 1991d8e to 7618668 Compare May 28, 2025 01:03

github-actions bot removed the module:tools label May 28, 2025

wangxiyuan approved these changes May 28, 2025

View reviewed changes

[Bugfix][V1] Fix deepseek with v1

7fd38bf

Signed-off-by: Mengqing Cao <cmq0113@163.com>

MengqingCao force-pushed the fix branch from 025e915 to 7fd38bf Compare May 28, 2025 14:36

Yikun approved these changes May 28, 2025

View reviewed changes

wangxiyuan merged commit cc74b97 into vllm-project:main May 29, 2025
17 of 22 checks passed

MengqingCao deleted the fix branch May 30, 2025 02:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix][V1] Fix deepseek with v1 #958

[Bugfix][V1] Fix deepseek with v1 #958

Uh oh!

MengqingCao commented May 26, 2025 •

edited

Loading

Uh oh!

MengqingCao commented May 28, 2025

Uh oh!

Yikun commented May 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Bugfix][V1] Fix deepseek with v1 #958

[Bugfix][V1] Fix deepseek with v1 #958

Uh oh!

Conversation

MengqingCao commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

How was this patch tested?

Uh oh!

MengqingCao commented May 28, 2025

Uh oh!

Yikun commented May 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MengqingCao commented May 26, 2025 •

edited

Loading