[v0.9.1][bugfix] fix accuracy prolem for deepseek V3/R1 models with torchair graph in long sequence predictions #1332

linfeng-yuan · 2025-06-20T18:43:09Z

What this PR does / why we need it?

Fix the issue of insufficient cached cosine and sine length in MLA's TorchAir graph mode, which causes accuracy deviation during long-sequence inference.

Backported: #1331

Does this PR introduce any user-facing change?

No.

How was this patch tested?

We tested the accuracy of this patch with DeepSeek R1 e2e becnhmark serving, and get 83.33 sore for AIME2024 dataset with DP4TP4EP16 setting.

… long sequence scenarios Signed-off-by: linfeng-yuan <1102311262@qq.com>

[fix]: fix accuracy problem for deepseek V3/R1 with torchair graph in…

b00678b

… long sequence scenarios Signed-off-by: linfeng-yuan <1102311262@qq.com>

wangxiyuan approved these changes Jun 21, 2025

View reviewed changes

wangxiyuan merged commit a3a3d38 into vllm-project:v0.9.1-dev Jun 21, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[v0.9.1][bugfix] fix accuracy prolem for deepseek V3/R1 models with torchair graph in long sequence predictions #1332

[v0.9.1][bugfix] fix accuracy prolem for deepseek V3/R1 models with torchair graph in long sequence predictions #1332

Uh oh!

linfeng-yuan commented Jun 20, 2025 •

edited by Yikun

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[v0.9.1][bugfix] fix accuracy prolem for deepseek V3/R1 models with torchair graph in long sequence predictions #1332

[v0.9.1][bugfix] fix accuracy prolem for deepseek V3/R1 models with torchair graph in long sequence predictions #1332

Uh oh!

Conversation

linfeng-yuan commented Jun 20, 2025 • edited by Yikun Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

linfeng-yuan commented Jun 20, 2025 •

edited by Yikun

Loading