Remove is_gpt_fast flag #172

jerryzh168 · 2024-04-24T21:35:26Z

Summary:
It was added before to merge the code for 8da4w and int4 weight only quant, but later we just duplicated the quantizer code, so we can safely remove this now.

in the future we'll refactor everything to use tensor subclass.

Test Plan:
tested locally to make sure test_8da4w_quantizer_eval still works

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: It was added before to merge the code for 8da4w and int4 weight only quant, but later we just duplicated the quantizer code, so we can safely remove this now. in the future we'll refactor everything to use tensor subclass. Test Plan: tested locally to make sure `test_8da4w_quantizer_eval` still works Reviewers: Subscribers: Tasks: Tags:

msaroufim · 2024-04-24T21:51:22Z

test/quantization/test_quant_api.py

@@ -268,57 +268,6 @@ def test_8da4w_quantizer_eval(self):
            f"accuracy regressed from 8.23 to {result['results']['wikitext']['word_perplexity,none']}"
        )

-    @unittest.skip("skipping until we get checkpoints for gpt-fast")
-    def test_gptq_quantizer_gpt_fast(self):


is this test not useful to keep around or some new version of it? Or more generally can we no longer use gpt-fast and ao together?

oh next test tests gpt fast code path: test_gptq_quantizer_int4wo, this was initially added because we were trying to merge the gpt fast code path and 8da4w code path together in the same quantizer code path and use a flag to distinguish them, but now we just duplicated the quantizer code

so since we removed the gpt-fast code path in Int8DynActInt4WeightGPTQQuantizer, we no longer need to test this

Summary: It was added before to merge the code for 8da4w and int4 weight only quant, but later we just duplicated the quantizer code, so we can safely remove this now. in the future we'll refactor everything to use tensor subclass. Test Plan: tested locally to make sure `test_8da4w_quantizer_eval` still works Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 24, 2024

jerryzh168 force-pushed the remove-flag branch from 9c22873 to 5e351a6 Compare April 24, 2024 21:41

jerryzh168 requested review from HDCharles and cpuhrsch April 24, 2024 21:43

cpuhrsch approved these changes Apr 24, 2024

View reviewed changes

Merge branch 'main' into remove-flag

be46441

msaroufim reviewed Apr 24, 2024

View reviewed changes

msaroufim self-requested a review April 24, 2024 22:05

msaroufim approved these changes Apr 24, 2024

View reviewed changes

Merge branch 'main' into remove-flag

919467a

msaroufim merged commit 7fc5e26 into pytorch:main Apr 24, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove is_gpt_fast flag #172

Remove is_gpt_fast flag #172

jerryzh168 commented Apr 24, 2024

msaroufim Apr 24, 2024 •

edited

Loading

jerryzh168 Apr 24, 2024 •

edited

Loading

Remove is_gpt_fast flag #172

Remove is_gpt_fast flag #172

Conversation

jerryzh168 commented Apr 24, 2024

msaroufim Apr 24, 2024 • edited Loading

Choose a reason for hiding this comment

jerryzh168 Apr 24, 2024 • edited Loading

Choose a reason for hiding this comment

msaroufim Apr 24, 2024 •

edited

Loading

jerryzh168 Apr 24, 2024 •

edited

Loading