[BugFix] Make FP8 Linear compatible with torch.compile #13918

WoosukKwon · 2025-02-26T19:21:23Z

This PR registers the FP8 block linear op for torch.compile

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

github-actions · 2025-02-26T19:21:34Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

mgoin

@tlrmchlsmth @LucasWilkinson

WoosukKwon · 2025-02-26T19:25:14Z

*This PR is not tested yet. Tested by @chenyang78

vllm/model_executor/layers/quantization/utils/fp8_utils.py

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

…13918) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

…13918) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

…13918) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

ProExpertProg · 2025-09-05T15:03:37Z

Does anyone remember why this was necessary? It's getting in the way of fusion so we want to unwrap it again

[BugFix] Make FP8 Linear compatible with torch.compile

541fe06

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon requested review from mgoin, robertgshaw2-redhat and tlrmchlsmth as code owners February 26, 2025 19:21

mgoin approved these changes Feb 26, 2025

View reviewed changes

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 26, 2025

chenyang78 reviewed Feb 26, 2025

View reviewed changes

vllm/model_executor/layers/quantization/utils/fp8_utils.py Outdated Show resolved Hide resolved

Fix

f113803

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon merged commit b382a7f into main Feb 26, 2025
12 of 16 checks passed

WoosukKwon deleted the fp8-linear-torch-compile branch February 26, 2025 21:48

Akshat-Tripathi pushed a commit to krai/vllm that referenced this pull request Mar 3, 2025

[BugFix] Make FP8 Linear compatible with torch.compile (vllm-project#…

5eb0d63

…13918) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

[BugFix] Make FP8 Linear compatible with torch.compile (vllm-project#…

00be88b

…13918) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

[BugFix] Make FP8 Linear compatible with torch.compile (vllm-project#…

8418a34

…13918) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[BugFix] Make FP8 Linear compatible with torch.compile #13918

[BugFix] Make FP8 Linear compatible with torch.compile #13918

Uh oh!

WoosukKwon commented Feb 26, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Feb 26, 2025

Uh oh!

mgoin left a comment

Uh oh!

WoosukKwon commented Feb 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ProExpertProg commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

[BugFix] Make FP8 Linear compatible with torch.compile #13918

[BugFix] Make FP8 Linear compatible with torch.compile #13918

Uh oh!

Conversation

WoosukKwon commented Feb 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 26, 2025

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

WoosukKwon commented Feb 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ProExpertProg commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

WoosukKwon commented Feb 26, 2025 •

edited

Loading

WoosukKwon commented Feb 26, 2025 •

edited

Loading