[Reland][CPU] Support int8 scaled embedding bag #3060

shiyang-weng · 2025-09-25T02:39:04Z

int8 scaled_embedding_bag reverted by #2974
On #2972, they shared error reason related to the unused qtype.

We re-enable int8 scaled_embedding_bag and fix the unused variables issue on this PR

pytorch-bot · 2025-09-25T02:39:08Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3060

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 02a1bc4 with merge base 0d3217d ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Xia-Weiwen · 2025-09-25T05:23:04Z

torchao/csrc/cpu/aten_kernels/scaled_embedding_bag.cpp

 }

-} // namespace torchao
+} // namespace torchao


unnecessary change?

Xia-Weiwen · 2025-09-25T05:24:33Z

CC @mingfeima for review. Thanks.

jerryzh168 · 2025-09-26T20:32:40Z

torchao/ops.py

+    # Next setp: support more out_dtype
+    out_dtype = torch.float32


should this arg be exposed to the op as well in the future?

should this arg be exposed to the op as well in the future?

Yes

jerryzh168

stamping, please make sure CI passes

…embedding_bag

Xia-Weiwen · 2025-09-28T07:15:35Z

Hi @mingfeima Could you please review this PR? Thanks.

mingfeima · 2025-09-29T06:32:05Z

torchao/csrc/cpu/aten_kernels/scaled_embedding_bag.cpp

 #endif

-template <typename index_t>
+template <typename index_t, typename data_t>


this is not mandatory, but you can use block_dim as an template argument and make this function simpler.

template <typename index_t, typename scalar_t, int block_dim>

mingfeima · 2025-09-29T06:36:07Z

one more thing, it would be better to refactor the fp8 conversition simd code with https://github.com/pytorch/ao/blob/main/torchao/csrc/cpu/aten_kernels/float8_linear.cpp, maybe put them in to vec util.h.

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 25, 2025

shiyang-weng marked this pull request as draft September 25, 2025 02:41

Xia-Weiwen changed the title ~~[CPU] Support int8 scaled embedding bag~~ [Reland][CPU] Support int8 scaled embedding bag Sep 25, 2025

pytorch-bot bot added the ci-no-td label Sep 25, 2025

Xia-Weiwen added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Sep 25, 2025

Xia-Weiwen approved these changes Sep 25, 2025

View reviewed changes

Xia-Weiwen self-requested a review September 25, 2025 03:10

shiyang-weng added 2 commits September 25, 2025 01:09

re-enable scaled_embedding_bag

a84276b

only support fp32 out_dtype

e846df7

shiyang-weng force-pushed the wengshiy/int8_scaled_embedding_bag branch from ac75d21 to e846df7 Compare September 25, 2025 05:18

Xia-Weiwen reviewed Sep 25, 2025

View reviewed changes

torchao/csrc/cpu/aten_kernels/scaled_embedding_bag.cpp

}

} // namespace torchao

} // namespace torchao

Copy link

Collaborator

Xia-Weiwen Sep 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

unnecessary change?

jerryzh168 reviewed Sep 26, 2025

View reviewed changes

jerryzh168 approved these changes Sep 26, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into wengshiy/int8_scaled_…

02a1bc4

…embedding_bag

mingfeima approved these changes Sep 29, 2025

View reviewed changes

shiyang-weng marked this pull request as ready for review September 29, 2025 07:06

Xia-Weiwen merged commit 4a03494 into pytorch:main Sep 29, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Reland][CPU] Support int8 scaled embedding bag #3060

[Reland][CPU] Support int8 scaled embedding bag #3060

Uh oh!

shiyang-weng commented Sep 25, 2025

Uh oh!

pytorch-bot bot commented Sep 25, 2025 •

edited

Loading

Uh oh!

Xia-Weiwen Sep 25, 2025

Uh oh!

Xia-Weiwen commented Sep 25, 2025

Uh oh!

jerryzh168 Sep 26, 2025

Uh oh!

shiyang-weng Sep 28, 2025

Uh oh!

jerryzh168 left a comment •

edited

Loading

Uh oh!

Xia-Weiwen commented Sep 28, 2025

Uh oh!

mingfeima Sep 29, 2025

Uh oh!

mingfeima commented Sep 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		# Next setp: support more out_dtype
		out_dtype = torch.float32

[Reland][CPU] Support int8 scaled embedding bag #3060

[Reland][CPU] Support int8 scaled embedding bag #3060

Uh oh!

Conversation

shiyang-weng commented Sep 25, 2025

Uh oh!

pytorch-bot bot commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3060

✅ No Failures

Uh oh!

Xia-Weiwen Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen commented Sep 25, 2025

Uh oh!

jerryzh168 Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

shiyang-weng Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen commented Sep 28, 2025

Uh oh!

mingfeima Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

mingfeima commented Sep 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Sep 25, 2025 •

edited

Loading

jerryzh168 left a comment •

edited

Loading