[Perf]remove unnecessary padding before MLA V1 prefill #917

Angazenn · 2025-05-21T09:02:40Z

What this PR does / why we need it?

Currently, the implementation for MLA V1 pads q, k, v to head_dim 256 to conform to early MLA kernel. But the new MLA kernel supports head_dim that can't be devided by 128. Therefore we can remove those unnecessary paddings to boost the performance

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Signed-off-by: angazenn <zengyanjia@huawei.com>

…ject#917)  ### What this PR does / why we need it? Currently, the implementation for MLA V1 pads q, k, v to `head_dim` 256 to conform to early MLA kernel. But the new MLA kernel supports `head_dim` that can't be devided by 128. Therefore we can remove those unnecessary paddings to boost the performance ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested?  Signed-off-by: angazenn <zengyanjia@huawei.com> Co-authored-by: angazenn <zengyanjia@huawei.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>

…ject#917)  ### What this PR does / why we need it? Currently, the implementation for MLA V1 pads q, k, v to `head_dim` 256 to conform to early MLA kernel. But the new MLA kernel supports `head_dim` that can't be devided by 128. Therefore we can remove those unnecessary paddings to boost the performance ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested?  Signed-off-by: angazenn <zengyanjia@huawei.com> Co-authored-by: angazenn <zengyanjia@huawei.com>

Angazenn changed the title ~~[WIP][Perf]remove unnecessary padding before mla prefill v1~~ [WIP][Perf]remove unnecessary padding before MLA V1 prefill May 21, 2025

remove unnecessary padding before mla prefill v1

e5e7a38

Signed-off-by: angazenn <zengyanjia@huawei.com>

Angazenn force-pushed the unpad branch from 8d71506 to e5e7a38 Compare May 21, 2025 09:34

ganyi1996ppo approved these changes May 23, 2025

View reviewed changes

ganyi1996ppo merged commit a970b27 into vllm-project:main May 23, 2025
15 checks passed

ganyi1996ppo changed the title ~~[WIP][Perf]remove unnecessary padding before MLA V1 prefill~~ [Perf]remove unnecessary padding before MLA V1 prefill May 23, 2025

Yikun mentioned this pull request Jun 28, 2025

vLLM Ascend Roadmap Q2 2025 #448

Closed

40 tasks

Angazenn deleted the unpad branch September 8, 2025 03:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Perf]remove unnecessary padding before MLA V1 prefill #917

[Perf]remove unnecessary padding before MLA V1 prefill #917

Uh oh!

Angazenn commented May 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Perf]remove unnecessary padding before MLA V1 prefill #917

[Perf]remove unnecessary padding before MLA V1 prefill #917

Uh oh!

Conversation

Angazenn commented May 21, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants