[TPU] Fix tpu torch compile error #26453

Chenyaaang · 2025-10-09T00:09:00Z

Fix TPU torch compile error after #26113

This pr includes:

Set compilation backend to openxla on TPU platform
Make sure TPU is using forward_tpu when dispatching in custom ops
Bypass some backend checks which require either eager or inductor for non-tpu platforms.

gemini-code-assist

Code Review

I've reviewed the changes and they look good for fixing the TPU compilation issue. The logic to enable custom ops and set the openxla backend for TPU is correct. I have one suggestion to improve the user experience by adding a log message when the backend is overridden, which is consistent with other parts of the code.

vllm/platforms/tpu.py

ProExpertProg

Thanks for catching this, sorry we didn't catch this during review. The reason this got broken is because the logic is complicated, could we improve it?

ProExpertProg · 2025-10-09T00:23:57Z

vllm/platforms/tpu.py

-            compilation_config.backend = "openxla"
+        # Note: the default backend is set to inductor now
+        # we want to overwrite to openxla to execute the ops properly on TPU.
+        compilation_config.backend = "openxla"


Can you make this a property of current_platform? Perhaps current_platform.default_dynamo_backend?

ProExpertProg · 2025-10-09T00:24:38Z

vllm/config/vllm.py

        # If user does not set custom ops via none or all set it here based on
        # compilation level and backend.
        if (
            self.compilation_config.custom_ops.count("none")
            + self.compilation_config.custom_ops.count("all")
            == 0
        ):
+            from vllm.platforms import current_platform
            if (
                self.compilation_config.level > 0
                and self.compilation_config.backend != "eager"
+                and not current_platform.is_tpu()
            ):
                self.compilation_config.custom_ops.append("none")
            else:
                self.compilation_config.custom_ops.append("all")


Let's just try to move this after the platform-specific update, and TPU can set custom ops the way it wants to.

yes we can move the self.compilation_config.custom_ops logic after current_platform.check_and_update_config(self), and the new statement will be self.compilation_config.backend not in ["eager", "openxla"], does it look better to you?

I actually think that tpu platform should just do:

if ( self.compilation_config.custom_ops.count("none") + self.compilation_config.custom_ops.count("all") == 0 ): self.compilation_config.custom_ops.append("all")

And then the common logic won't even be hit because custom_ops already contains "all" (once the logic is moved after).

After some investigations, I prefer not to move current_platform.check_and_update_config(self) up to before this part. I'm afraid the logic inside each platform's check_and_update_config might be related to some of its previous inits, and since my change is only related to TPU, I don't want to add more potential risks to other platforms.

Signed-off-by: Chenyaaang <chenyangli@google.com>

ProExpertProg · 2025-10-09T17:31:45Z

Original PR was reverted, we can include these fixes in the unrevert

Chenyaaang · 2025-10-09T20:09:07Z

Original PR was reverted, we can include these fixes in the unrevert

SG, I'll go ahead and close this pr. Can you also cc me in the new pr? Thanks!

Chenyaaang requested review from NickLucche, ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, simon-mo, tlrmchlsmth, yewentao256 and youkaichao as code owners October 9, 2025 00:09

Chenyaaang force-pushed the torch-compile-error branch from dcc3238 to c1c65d0 Compare October 9, 2025 00:09

mergify bot added the tpu Related to Google TPUs label Oct 9, 2025

gemini-code-assist bot reviewed Oct 9, 2025

View reviewed changes

vllm/platforms/tpu.py Show resolved Hide resolved

ProExpertProg requested changes Oct 9, 2025

View reviewed changes

Chenyaaang force-pushed the torch-compile-error branch from c1c65d0 to 280b389 Compare October 9, 2025 00:26

fix tpu torch compile error

144aeb5

Signed-off-by: Chenyaaang <chenyangli@google.com>

Chenyaaang force-pushed the torch-compile-error branch from 280b389 to 144aeb5 Compare October 9, 2025 00:26

Chenyaaang closed this Oct 9, 2025

Chenyaaang mentioned this pull request Oct 16, 2025

[TPU] Fix tpu torch compile error #27049

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[TPU] Fix tpu torch compile error #26453

[TPU] Fix tpu torch compile error #26453

Uh oh!

Chenyaaang commented Oct 9, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

ProExpertProg left a comment

Uh oh!

ProExpertProg Oct 9, 2025

Uh oh!

ProExpertProg Oct 9, 2025

Uh oh!

Chenyaaang Oct 9, 2025

Uh oh!

ProExpertProg Oct 9, 2025

Uh oh!

Chenyaaang Oct 16, 2025

Uh oh!

ProExpertProg commented Oct 9, 2025

Uh oh!

Chenyaaang commented Oct 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[TPU] Fix tpu torch compile error #26453

[TPU] Fix tpu torch compile error #26453

Uh oh!

Conversation

Chenyaaang commented Oct 9, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ProExpertProg left a comment

Choose a reason for hiding this comment

Uh oh!

ProExpertProg Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

ProExpertProg Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

Chenyaaang Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

ProExpertProg Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

Chenyaaang Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

ProExpertProg commented Oct 9, 2025

Uh oh!

Chenyaaang commented Oct 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Chenyaaang commented Oct 9, 2025 •

edited by github-actions bot

Loading