[Bugfix] Fix MRoPE dispatch on XPU #24724

yma11 · 2025-09-12T05:57:20Z

Purpose

Fix MRoPE dispatch issue on xpu introduced in #24444

Test Plan

VLLM_ALLOW_LONG_MAX_MODEL_LEN=1 VLLM_WORKER_MULTIPROC_METHOD=spawn python3 examples/offline_inference/basic/generate.py --model Qwen/Qwen2.5-VL-7B-Instruct --enforce-eager

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Yan Ma <yan.ma@intel.com>

yma11 · 2025-09-12T05:58:11Z

@jikunshang please take a review.

gemini-code-assist

Code Review

This pull request addresses a dispatch issue for MRoPE (Multimodal Rotary Position Embedding) on XPU devices. The PR description mentions CPU, but the change correctly targets XPU. The MRotaryEmbedding class was incorrectly inheriting the forward_xpu method from its RotaryEmbedding base class. The base implementation lacks support for the specific logic required for multimodal inputs (where positions.ndim == 2), which would lead to incorrect behavior.

The fix introduces a forward_xpu method in the MRotaryEmbedding class that dispatches to its own forward_native implementation. This is the correct approach, as MRotaryEmbedding.forward_native contains the necessary logic to handle multimodal inputs, ensuring correct functionality on XPU devices. This change aligns the XPU implementation with the existing CPU fallback, providing a correct execution path. The change is sound and resolves the bug.

jikunshang · 2025-09-12T06:46:29Z

vllm/model_executor/layers/rotary_embedding/mrope.py

        key = torch.cat((key_rot, key_pass), dim=-1).reshape(key_shape)
        return query, key

+    def forward_xpu(


what will xpu call without this?

https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/layers/rotary_embedding/base.py#L122, I think this is the expected path but there will be tensor mismatch error when calling the kernel and strange that previously we don't run in this path. I will take a look at this further, but need this fix to unblock Qwen2.5 VL.

Signed-off-by: Yan Ma <yan.ma@intel.com>

Signed-off-by: Yan Ma <yan.ma@intel.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Signed-off-by: Yan Ma <yan.ma@intel.com>

Signed-off-by: Yan Ma <yan.ma@intel.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

[Bugfix] Fix MRoPE dispatch on XPU

5a1438a

Signed-off-by: Yan Ma <yan.ma@intel.com>

gemini-code-assist bot reviewed Sep 12, 2025

View reviewed changes

jikunshang reviewed Sep 12, 2025

View reviewed changes

jikunshang approved these changes Sep 12, 2025

View reviewed changes

jikunshang added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 12, 2025

DarkLight1337 merged commit 4d7c1d5 into vllm-project:main Sep 12, 2025
55 checks passed

skyloevil pushed a commit to skyloevil/vllm that referenced this pull request Sep 13, 2025

[Bugfix] Fix MRoPE dispatch on XPU (vllm-project#24724)

2bc231e

Signed-off-by: Yan Ma <yan.ma@intel.com>

MengqingCao pushed a commit to MengqingCao/vllm that referenced this pull request Sep 13, 2025

[Bugfix] Fix MRoPE dispatch on XPU (vllm-project#24724)

fba7e35

Signed-off-by: Yan Ma <yan.ma@intel.com>

dsxsteven pushed a commit to dsxsteven/vllm_splitPR that referenced this pull request Sep 15, 2025

[Bugfix] Fix MRoPE dispatch on XPU (vllm-project#24724)

acd4a8b

Signed-off-by: Yan Ma <yan.ma@intel.com>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Bugfix] Fix MRoPE dispatch on XPU (vllm-project#24724)

179bb48

Signed-off-by: Yan Ma <yan.ma@intel.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Bugfix] Fix MRoPE dispatch on XPU (vllm-project#24724)

73b6271

Signed-off-by: Yan Ma <yan.ma@intel.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

[Bugfix] Fix MRoPE dispatch on XPU (vllm-project#24724)

cdd7788

Signed-off-by: Yan Ma <yan.ma@intel.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[Bugfix] Fix MRoPE dispatch on XPU (vllm-project#24724)

1c521cd

Signed-off-by: Yan Ma <yan.ma@intel.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix MRoPE dispatch on XPU #24724

[Bugfix] Fix MRoPE dispatch on XPU #24724

Uh oh!

yma11 commented Sep 12, 2025 •

edited by github-actions bot

Loading

Uh oh!

yma11 commented Sep 12, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

jikunshang Sep 12, 2025

Uh oh!

yma11 Sep 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Bugfix] Fix MRoPE dispatch on XPU #24724

[Bugfix] Fix MRoPE dispatch on XPU #24724

Uh oh!

Conversation

yma11 commented Sep 12, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

yma11 commented Sep 12, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

jikunshang Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

yma11 Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yma11 commented Sep 12, 2025 •

edited by github-actions bot

Loading

yma11 Sep 12, 2025 •

edited

Loading