[BUGFIX][0.9.1] FIX ring_mla input ‘query_lens’ to cpu #2170

JC-ut0 · 2025-08-01T07:53:36Z

What this PR does / why we need it?

[BUGFIX][0.9.1] FIX ring_mla input ‘query_lens’ to cpu

Does this PR introduce any user-facing change?

How was this patch tested?

Signed-off-by: xuyexiong <xuyexiong@huawei.com>

…nto qwen30-dev * 'qwen30-dev' of https://github.com/rjg-lyh/vllm-ascend: [V0.9.1] Replace FA ops with FA_V2 to optimize perf [0.9.1]remove chunked_prefill_for_mla (vllm-project#2177) move with_prefill allreduce from cpu to npu (vllm-project#2230) [v0.9.1] Add release note for v0.9.1rc2 (vllm-project#2233) [Docs] Sync main doc to v0.9.1-dev (vllm-project#2227) [0.9.1] Enable external distributed dp deployments in vllm ascend(0.9.1 only) (vllm-project#2109) [V0.9.1][BugFix] Fix the bug in decoraotor patch (vllm-project#2199) [v0.9.1][Bugfix][PD] Auto-clear producer KV cache if no pull notification (vllm-project#2085) [BUGFIX][0.9.1] FIX ring_mla input ‘query_lens’ to cpu (vllm-project#2170) [0.9.1][Prefill Perf] add D2H & initRoutingQuantV2 (vllm-project#2038) [bugfix] add with_prefill cpu allreduce to handle D-node recomputatio… (vllm-project#2129)

[BUGFIX][0.9.1] FIX ring_mla input ‘query_lens’ to cpu

66eeeef

Signed-off-by: xuyexiong <xuyexiong@huawei.com>

ganyi1996ppo approved these changes Aug 1, 2025

View reviewed changes

ganyi1996ppo merged commit 741a8cf into vllm-project:v0.9.1-dev Aug 1, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUGFIX][0.9.1] FIX ring_mla input ‘query_lens’ to cpu #2170

[BUGFIX][0.9.1] FIX ring_mla input ‘query_lens’ to cpu #2170

Uh oh!

JC-ut0 commented Aug 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[BUGFIX][0.9.1] FIX ring_mla input ‘query_lens’ to cpu #2170

[BUGFIX][0.9.1] FIX ring_mla input ‘query_lens’ to cpu #2170

Uh oh!

Conversation

JC-ut0 commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JC-ut0 commented Aug 1, 2025 •

edited

Loading