[Bugfix] Fix platform-specific routing in CustomOp implementations #24444

kzawora-intel · 2025-09-08T13:05:41Z

Some layers inheriting from CustomOp are overwriting the forward method, effectively disabling the platform routing logic defined in CustomOp.dispatch_forward(), resulting in self._forward_method not being called. This is problematic for any out-of-tree backends, as providing backend-specific forward_oot doesn't result in calling the proper forward pass method, even though it's properly selected in self._forward_method. This PR moves forward overrides in layers inheriting from CustomOp to forward_native, and adds forward_cuda also calling forward_native - preserving the current behavior for all existing in-tree backends, but preserving routing logic for out-of-tree backends.

Signed-off-by: Konrad Zawora <kzawora@habana.ai>

gemini-code-assist

Code Review

This pull request addresses a bug in CustomOp implementations where platform-specific routing logic was being bypassed due to forward method overrides. The fix involves renaming forward methods to forward_native and adding forward_cuda methods that call forward_native, thus preserving routing logic for out-of-tree backends. The review focuses on ensuring the correctness of these changes and identifying any potential issues.

Signed-off-by: Konrad Zawora <kzawora@habana.ai>

xuechendi · 2025-09-08T15:14:27Z

@MengqingCao @simon-mo @youkaichao @DarkLight1337 , may you help to take a quick look of this PR

vllm/model_executor/layers/rotary_embedding/mrope.py

xuechendi · 2025-09-08T15:29:49Z

@gshtras , may you take a look of this PR

jikunshang

please fix precommit.

MengqingCao

This pr is very clean, LGTM

ProExpertProg

Please fix pre-commit

Signed-off-by: Konrad Zawora <kzawora@habana.ai>

kzawora-intel · 2025-09-11T12:36:48Z

@jikunshang @MengqingCao @ProExpertProg pre-commit is green now

…llm-project#24444) Signed-off-by: Konrad Zawora <kzawora@habana.ai>

- HPU Mrope implementation had a bug which was exposed by vllm-project/vllm#24444 - Initial workaround was to use the default implementation: #162 - This PR fixes the bug in the HPU mrope --------- Signed-off-by: attafosu <thomas.atta-fosu@intel.com> Co-authored-by: Chendi.Xue <chendi.xue@intel.com>

jikunshang · 2025-09-18T14:13:22Z

this PR bring some issues. see https://github.com/vllm-project/vllm/pull/25145/files#r2359528376

- HPU Mrope implementation had a bug which was exposed by vllm-project/vllm#24444 - Initial workaround was to use the default implementation: vllm-project#162 - This PR fixes the bug in the HPU mrope --------- Signed-off-by: attafosu <thomas.atta-fosu@intel.com> Co-authored-by: Chendi.Xue <chendi.xue@intel.com> Signed-off-by: slokesha <slokeshappa@habana.ai>

…llm-project#24444) Signed-off-by: Konrad Zawora <kzawora@habana.ai>

…llm-project#24444) Signed-off-by: Konrad Zawora <kzawora@habana.ai> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

kzawora-intel added 2 commits September 8, 2025 15:51

[Bugfix] Fix platform-specific routing in CustomOp implementations

3d21760

Signed-off-by: Konrad Zawora <kzawora@habana.ai>

missing forward_cuda in vocab parallel embedding

1964d72

Signed-off-by: Konrad Zawora <kzawora@habana.ai>

gemini-code-assist bot reviewed Sep 8, 2025

View reviewed changes

phi3 rope fix

8e1a529

Signed-off-by: Konrad Zawora <kzawora@habana.ai>

xuechendi reviewed Sep 8, 2025

View reviewed changes

vllm/model_executor/layers/rotary_embedding/mrope.py Show resolved Hide resolved

jikunshang approved these changes Sep 9, 2025

View reviewed changes

MengqingCao reviewed Sep 9, 2025

View reviewed changes

ProExpertProg approved these changes Sep 9, 2025

View reviewed changes

kzawora-intel added 2 commits September 11, 2025 15:24

ignore Liskov substitution principle

d9384c5

Signed-off-by: Konrad Zawora <kzawora@habana.ai>

Merge branch 'main' into private/kzawora/custom_op_fix

5297536

ProExpertProg approved these changes Sep 11, 2025

View reviewed changes

ProExpertProg enabled auto-merge (squash) September 11, 2025 14:21

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 11, 2025

ProExpertProg merged commit 4aa2389 into vllm-project:main Sep 11, 2025
51 of 53 checks passed

bigPYJ1151 mentioned this pull request Sep 12, 2025

[Bugfix] Fix MRoPE dispatch on CPU #24712

Merged

5 tasks

yma11 mentioned this pull request Sep 12, 2025

[Bugfix] Fix MRoPE dispatch on XPU #24724

Merged

5 tasks

attafosu mentioned this pull request Sep 12, 2025

Bug fix: hpu mrope vllm-project/vllm-gaudi#167

Merged

skyloevil pushed a commit to skyloevil/vllm that referenced this pull request Sep 13, 2025

[Bugfix] Fix platform-specific routing in CustomOp implementations (v…

7b7b33b

…llm-project#24444) Signed-off-by: Konrad Zawora <kzawora@habana.ai>

dsxsteven pushed a commit to dsxsteven/vllm_splitPR that referenced this pull request Sep 15, 2025

[Bugfix] Fix platform-specific routing in CustomOp implementations (v…

cee3a87

…llm-project#24444) Signed-off-by: Konrad Zawora <kzawora@habana.ai>

yma11 mentioned this pull request Sep 18, 2025

[XPU][bugfix] fix rope for llama4 and deepseek #25145

Merged

5 tasks

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Bugfix] Fix platform-specific routing in CustomOp implementations (v…

9f00890

…llm-project#24444) Signed-off-by: Konrad Zawora <kzawora@habana.ai>

Isotr0py mentioned this pull request Oct 20, 2025

[Kernel] Re-enable mrope triton kernel for CUDA/ROCM platform by default #27216

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix platform-specific routing in CustomOp implementations #24444

[Bugfix] Fix platform-specific routing in CustomOp implementations #24444

Uh oh!

kzawora-intel commented Sep 8, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

xuechendi commented Sep 8, 2025

Uh oh!

Uh oh!

xuechendi commented Sep 8, 2025

Uh oh!

jikunshang left a comment

Uh oh!

MengqingCao left a comment

Uh oh!

ProExpertProg left a comment

Uh oh!

kzawora-intel commented Sep 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

jikunshang commented Sep 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

[Bugfix] Fix platform-specific routing in CustomOp implementations #24444

[Bugfix] Fix platform-specific routing in CustomOp implementations #24444

Uh oh!

Conversation

kzawora-intel commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

xuechendi commented Sep 8, 2025

Uh oh!

Uh oh!

xuechendi commented Sep 8, 2025

Uh oh!

jikunshang left a comment

Choose a reason for hiding this comment

Uh oh!

MengqingCao left a comment

Choose a reason for hiding this comment

Uh oh!

ProExpertProg left a comment

Choose a reason for hiding this comment

Uh oh!

kzawora-intel commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jikunshang commented Sep 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

kzawora-intel commented Sep 8, 2025 •

edited

Loading

kzawora-intel commented Sep 11, 2025 •

edited

Loading