[Refactor][Kernel] support loading kernel from other place #25823

ILikeIneine · 2025-09-28T06:29:35Z

Purpose

add platform interface to support loading kernel
relate to: #25822

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request refactors kernel loading to be platform-specific by introducing import_general_kernels and import_moe_kernels methods in the Platform interface. The changes in _custom_ops.py now delegate kernel imports to the current platform. The default implementation is provided in interface.py, and TpuPlatform and XPUPlatform provide overrides.

My review identifies a couple of areas for improvement. The error logging in _custom_ops.py should be generalized to avoid confusion. Additionally, for consistency and correctness, the TpuPlatform and XPUPlatform should also override the import_moe_kernels method, similar to how import_general_kernels is handled.

vllm/_custom_ops.py

NickLucche

Clean change thank you @ILikeIneine !
Left a minor comment agreeing with @ProExpertProg .
I am not too enthusiastic with the import_general_kernels function name, but I am also not great with names. What do you think of:
-import_core_kernels
-import_base_kernels
-import_common_kernels

Other than that this is LGTM.

vllm/_custom_ops.py

ILikeIneine · 2025-09-30T03:45:02Z

@NickLucche @ProExpertProg Changes are updated! Still I'm using import_core_kernels, need it being try_import_core_kernels? Though it should stop if import_core_kernels failed. I don't think we should give it a try😂

NickLucche · 2025-09-30T08:55:19Z

Though it should stop if import_core_kernels failed

I agree in principle, I think the try-except guard we have right now it's just for cases where vllm is compiled without a device (eg you just run benchmarks).
Let's wait for @ProExpertProg to ack here.

ProExpertProg

Much cleaner, thanks! Name seems fine to me now

mergify · 2025-10-03T07:03:28Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @ILikeIneine.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

ILikeIneine · 2025-10-05T01:40:19Z

Hi, seems the pr tests are not blocked by this pr, could you take a look and re-trigger the CI? or merge directly? @NickLucche @ProExpertProg

Signed-off-by: Hank <hcc.mayday@gmail.com>

youkaichao · 2025-10-05T11:55:21Z

vllm/_custom_ops.py

-    import vllm._moe_C  # noqa: F401
-    supports_moe_ops = True
+current_platform.import_core_kernels()
+supports_moe_ops = current_platform.try_import_moe_kernels()


this attribute is only used here:

vllm/vllm/_custom_ops.py

Line 1539 in 17edd8a

if supports_moe_ops and hasattr(torch.ops._moe_C, "marlin_gemm_moe"):

I wonder if we can remove supports_moe_ops and just do:

if hasattr(torch.ops, "_moe_C") and hasattr(torch.ops._moe_C, "marlin_gemm_moe"):

then having a simple import_kernels interface for the platform class would sounds better.

cc @NickLucche

@ProExpertProg do you know why we don't use if hasattr(torch.ops, "_moe_C") and hasattr(torch.ops._moe_C, "marlin_gemm_moe"): in the first place? or maybe @tlrmchlsmth @bnellnm have better ideas.

Using hasattr seems reasonable to me.

Yeah ++ that seems simpler

Signed-off-by: Hank <hcc.mayday@gmail.com> Signed-off-by: Tomer Asida <57313761+tomeras91@users.noreply.github.com>

Signed-off-by: Hank <hcc.mayday@gmail.com> Signed-off-by: Karan Goel <3261985+karan@users.noreply.github.com>

Signed-off-by: Hank <hcc.mayday@gmail.com>

Signed-off-by: Hank <hcc.mayday@gmail.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Signed-off-by: Hank <hcc.mayday@gmail.com>

Signed-off-by: Hank <hcc.mayday@gmail.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Signed-off-by: Hank <hcc.mayday@gmail.com>

ILikeIneine requested review from NickLucche and jikunshang as code owners September 28, 2025 06:29

mergify bot added the tpu Related to Google TPUs label Sep 28, 2025

gemini-code-assist bot reviewed Sep 28, 2025

View reviewed changes

vllm/_custom_ops.py Outdated Show resolved Hide resolved

vllm/_custom_ops.py Outdated Show resolved Hide resolved

ILikeIneine mentioned this pull request Sep 28, 2025

[RFC][Plugin]: support loading kernels from other place #25822

Open

1 task

ProExpertProg reviewed Sep 29, 2025

View reviewed changes

vllm/_custom_ops.py Outdated Show resolved Hide resolved

NickLucche approved these changes Sep 29, 2025

View reviewed changes

vllm/_custom_ops.py Outdated Show resolved Hide resolved

vllm/_custom_ops.py Outdated Show resolved Hide resolved

ILikeIneine requested a review from NickLucche September 30, 2025 06:48

ILikeIneine requested a review from ProExpertProg October 1, 2025 11:25

ProExpertProg approved these changes Oct 1, 2025

View reviewed changes

ProExpertProg enabled auto-merge (squash) October 1, 2025 15:08

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 1, 2025

mergify bot added the needs-rebase label Oct 3, 2025

auto-merge was automatically disabled October 4, 2025 03:54
Head branch was pushed to by a user without write access

mergify bot removed the needs-rebase label Oct 4, 2025

ILikeIneine requested a review from ProExpertProg October 5, 2025 01:40

ILikeIneine added 2 commits October 5, 2025 10:18

feat: support loading kernel from other place

382d698

Signed-off-by: Hank <hcc.mayday@gmail.com>

refactor: adjust the logic of import kernels

0358999

Signed-off-by: Hank <hcc.mayday@gmail.com>

ILikeIneine force-pushed the support-load-kernels-from-plugin branch from dee9eb8 to 0358999 Compare October 5, 2025 02:20

remove warning of missing moe_C kernel

51a3c8c

Signed-off-by: Hank <hcc.mayday@gmail.com>

NickLucche merged commit 17edd8a into vllm-project:main Oct 5, 2025
45 checks passed

youkaichao reviewed Oct 5, 2025

View reviewed changes

tomeras91 pushed a commit to tomeras91/vllm that referenced this pull request Oct 6, 2025

[Platform][Kernel] platform-specific kernel loading (vllm-project#25823)

652a359

Signed-off-by: Hank <hcc.mayday@gmail.com> Signed-off-by: Tomer Asida <57313761+tomeras91@users.noreply.github.com>

NickLucche mentioned this pull request Oct 6, 2025

[Kernel] Centralize platform kernel import in current_platform.import_kernels #26286

Merged

karan pushed a commit to karan/vllm that referenced this pull request Oct 6, 2025

[Platform][Kernel] platform-specific kernel loading (vllm-project#25823)

b379e9a

Signed-off-by: Hank <hcc.mayday@gmail.com> Signed-off-by: Karan Goel <3261985+karan@users.noreply.github.com>

southfreebird pushed a commit to southfreebird/vllm that referenced this pull request Oct 7, 2025

[Platform][Kernel] platform-specific kernel loading (vllm-project#25823)

6f52416

Signed-off-by: Hank <hcc.mayday@gmail.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Platform][Kernel] platform-specific kernel loading (vllm-project#25823)

6e54881

Signed-off-by: Hank <hcc.mayday@gmail.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

ILikeIneine mentioned this pull request Oct 14, 2025

[RFC] Allow OOT to import vllm._C MetaX-MACA/vLLM-metax#17

Closed

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Platform][Kernel] platform-specific kernel loading (vllm-project#25823)

1bdbdda

Signed-off-by: Hank <hcc.mayday@gmail.com>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[Platform][Kernel] platform-specific kernel loading (vllm-project#25823)

80e05cc

Signed-off-by: Hank <hcc.mayday@gmail.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[Platform][Kernel] platform-specific kernel loading (vllm-project#25823)

a693f68

Signed-off-by: Hank <hcc.mayday@gmail.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025

[Platform][Kernel] platform-specific kernel loading (vllm-project#25823)

75c3603

Signed-off-by: Hank <hcc.mayday@gmail.com>

Uh oh!

[Refactor][Kernel] support loading kernel from other place #25823

[Refactor][Kernel] support loading kernel from other place #25823

Uh oh!

Conversation

ILikeIneine commented Sep 28, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ILikeIneine commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NickLucche commented Sep 30, 2025

Uh oh!

ProExpertProg left a comment

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Oct 3, 2025

Uh oh!

ILikeIneine commented Oct 5, 2025

Uh oh!

Uh oh!

youkaichao Oct 5, 2025

Choose a reason for hiding this comment

Uh oh!

youkaichao Oct 5, 2025

Choose a reason for hiding this comment

Uh oh!

bnellnm Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

ProExpertProg Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ILikeIneine commented Sep 28, 2025 •

edited by github-actions bot

Loading

ILikeIneine commented Sep 30, 2025 •

edited

Loading