Update QAT README.md #2162

SalmanMohammadi · 2025-05-02T09:57:39Z

Closes #2155.

pytorch-bot · 2025-05-02T09:57:42Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2162

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168 · 2025-05-02T23:48:22Z

cc @andrewor14

andrewor14 · 2025-05-05T14:33:10Z

torchao/quantization/qat/README.md

 quantize_(
    m,
    IntXQuantizationAwareTrainingConfig(weight_config=weight_config),
-    filter_fn=lambda m, _: isinstance(m, torch.nn.Embedding),
+    filter_fn=lambda m, _: isinstance(m, torch.nn.Embedding) or _is_linear(m),


Actually this is only if you want to use the same configuration for embedding and linear. I kept them as two separate calls because in the above example linear additionally has activation quantization.

Yeah you're right and this example won't work if you try and apply a config with activation quantization to both linear and embedding layers at the same time.
You can stack calls to quantize right? Would the right way to go about this two quantize calls, one which iflters for linear, then another which filters for embeddings?

Yeah if you have slightly different quantization configurations for embedding and linear the right way would be two separate quantize_ calls. This is by design because we don't want to complicate quantize_ to accept a dictionary of configs

Update QAT README.md

b44ea8d

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 2, 2025

jerryzh168 approved these changes May 2, 2025

View reviewed changes

jerryzh168 merged commit 4850998 into pytorch:main May 2, 2025
6 of 18 checks passed

andrewor14 reviewed May 5, 2025

View reviewed changes

SalmanMohammadi mentioned this pull request May 28, 2025

Update QAT docs, highlight axolotl integration #2266

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update QAT README.md #2162

Update QAT README.md #2162

Uh oh!

SalmanMohammadi commented May 2, 2025

Uh oh!

pytorch-bot bot commented May 2, 2025

Uh oh!

Uh oh!

jerryzh168 commented May 2, 2025

Uh oh!

andrewor14 May 5, 2025

Uh oh!

SalmanMohammadi May 6, 2025

Uh oh!

andrewor14 May 6, 2025

Uh oh!

Uh oh!

Update QAT README.md #2162

Update QAT README.md #2162

Uh oh!

Conversation

SalmanMohammadi commented May 2, 2025

Uh oh!

pytorch-bot bot commented May 2, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2162

Uh oh!

Uh oh!

jerryzh168 commented May 2, 2025

Uh oh!

andrewor14 May 5, 2025

Choose a reason for hiding this comment

Uh oh!

SalmanMohammadi May 6, 2025

Choose a reason for hiding this comment

Uh oh!

andrewor14 May 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!