Use 4x3 tiled shader for linear mat mul which performs slightly better. #15988

trivedivivek · 2025-11-26T06:44:51Z

Summary: This diff optimizes the performance of the quantized linear matrix multiplication operation by using a 4x3 tiled shader, which performs slightly better than the previous implementation.

Differential Revision: D87902847

pytorch-bot · 2025-11-26T06:44:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15988

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 183c8ef with merge base 3ce840c ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2025-11-26T06:44:58Z

@trivedivivek has exported this pull request. If you are a Meta employee, you can view the originating Diff in D87902847.

…r. (pytorch#15988) Summary: This diff optimizes the performance of the quantized linear matrix multiplication operation by using a 4x3 tiled shader, which performs slightly better than the previous implementation. Differential Revision: D87902847

…r. (pytorch#15988) Summary: This diff optimizes the performance of the quantized linear matrix multiplication operation by using a 4x3 tiled shader, which performs slightly better than the previous implementation. Reviewed By: yipjustin Differential Revision: D87902847

Differential Revision: D87902847 Pull Request resolved: pytorch#15988

trivedivivek requested a review from SS-JIA as a code owner November 26, 2025 06:44

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 26, 2025

meta-codesync bot added fb-exported meta-exported labels Nov 26, 2025

trivedivivek added the release notes: vulkan Changes to the Vulkan backend delegate label Nov 26, 2025

yipjustin approved these changes Nov 26, 2025

View reviewed changes

trivedivivek force-pushed the export-D87902847 branch from 2a2fa9a to ef53422 Compare November 26, 2025 21:16

trivedivivek force-pushed the export-D87902847 branch from ef53422 to 45ea9d8 Compare December 1, 2025 04:20

trivedivivek force-pushed the export-D87902847 branch from 45ea9d8 to d9c2059 Compare December 1, 2025 04:22

trivedivivek force-pushed the export-D87902847 branch from d9c2059 to 1dc0e0d Compare December 1, 2025 20:40

trivedivivek force-pushed the export-D87902847 branch from 1dc0e0d to 183c8ef Compare December 1, 2025 20:47

meta-codesync bot merged commit ccc5eb0 into pytorch:main Dec 2, 2025
144 checks passed

AdrianLundell pushed a commit to AdrianLundell/executorch that referenced this pull request Dec 2, 2025

Use 4x3 tiled shader for linear mat mul which performs slightly better.

9eb5aa9

Differential Revision: D87902847 Pull Request resolved: pytorch#15988

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use 4x3 tiled shader for linear mat mul which performs slightly better. #15988

Use 4x3 tiled shader for linear mat mul which performs slightly better. #15988

trivedivivek commented Nov 26, 2025

Uh oh!

pytorch-bot bot commented Nov 26, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Nov 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Use 4x3 tiled shader for linear mat mul which performs slightly better. #15988

Use 4x3 tiled shader for linear mat mul which performs slightly better. #15988

Conversation

trivedivivek commented Nov 26, 2025

Uh oh!

pytorch-bot bot commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15988

✅ No Failures

Uh oh!

meta-codesync bot commented Nov 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot bot commented Nov 26, 2025 •

edited

Loading