is_linear fix for MHA #1141

HDCharles · 2024-10-22T20:12:43Z

Summary: filter fn may need access to parent types of module as in the case with mha

Test Plan: python /home/cdhernandez/local/ao/test/integration/test_integration.py -k "test_autoquant_mha"

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot · 2024-10-22T20:12:48Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1141

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4f45e03 with merge base 4b563f2 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168 · 2024-10-23T00:13:39Z

torchao/quantization/quant_api.py

@@ -234,6 +234,7 @@ def _is_linear(mod, *args):
        and not isinstance(mod.weight, AffineQuantizedTensor)
        and not isinstance(mod.weight, LinearActivationQuantizedTensor)
        and not isinstance(mod.weight, AffineFakeQuantizedTensor)
+        and not isinstance(mod, nn.modules.linear.NonDynamicallyQuantizableLinear)


nit: maybe add a Note here, to say that if MHA is refactored to not use this module we'd need a different solution

jerryzh168

looks good, thanks!

jerryzh168 · 2024-10-23T21:59:46Z

test/integration/test_integration.py

@@ -1278,6 +1278,32 @@ def test_autoquant_compile(self, device, dtype, m1, m2, k, n):
        sqnr = SQNR(out, out2)
        self.assertTrue(sqnr >= 30)

+    @parameterized.expand(COMMON_DEVICE_DTYPE)
+    @unittest.skipIf(not TORCH_VERSION_AT_LEAST_2_5, "autoquant requires 2.5+.")


according to CI, probably need to add skip if cuda is not available, or change this

ao/test/integration/test_integration.py

Line 97 in 629aee1

COMMON_DEVICES = ["cpu", "cuda"]

to not include cuda when it's not available

Summary: filter fn may need access to parent types of module as in the case with mha Test Plan: TODO Reviewers: Subscribers: Tasks: Tags:

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 22, 2024

HDCharles changed the title ~~[wip] fix for mha~~ is_linear fix for MHA Oct 22, 2024

HDCharles requested review from jerryzh168 and IvanKobzarev October 22, 2024 20:55

jerryzh168 reviewed Oct 23, 2024

View reviewed changes

jerryzh168 approved these changes Oct 23, 2024

View reviewed changes

jerryzh168 mentioned this pull request Oct 23, 2024

[Bug] ERR: subclass doesn't implement <function multi_head_attention_forward> #1103

Closed

jerryzh168 reviewed Oct 23, 2024

View reviewed changes

HDCharles added 4 commits October 25, 2024 10:52

[wip] fix for mha

83a6b7f

Summary: filter fn may need access to parent types of module as in the case with mha Test Plan: TODO Reviewers: Subscribers: Tasks: Tags:

testing

47239d8

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

fixing dtype and device in test

d3af4d5

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

fixing test

4f45e03

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

HDCharles force-pushed the 047_fix_mha branch from 2b08079 to 4f45e03 Compare October 25, 2024 17:54

HDCharles merged commit fec5420 into main Oct 25, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is_linear fix for MHA #1141

is_linear fix for MHA #1141

HDCharles commented Oct 22, 2024 •

edited

Loading

pytorch-bot bot commented Oct 22, 2024 •

edited

Loading

jerryzh168 Oct 23, 2024

jerryzh168 left a comment

jerryzh168 Oct 23, 2024

is_linear fix for MHA #1141

is_linear fix for MHA #1141

Conversation

HDCharles commented Oct 22, 2024 • edited Loading

pytorch-bot bot commented Oct 22, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1141

✅ No Failures

jerryzh168 Oct 23, 2024

Choose a reason for hiding this comment

jerryzh168 left a comment

Choose a reason for hiding this comment

jerryzh168 Oct 23, 2024

Choose a reason for hiding this comment

HDCharles commented Oct 22, 2024 •

edited

Loading

pytorch-bot bot commented Oct 22, 2024 •

edited

Loading