[GPTQ] Fix test #28018

SunMarc · 2023-12-13T20:35:55Z

What does this PR do ?

This PR fixes failing tests related to GPTQ quantization. The breaking tests are related to modification on optimum side and OOM from the new runner. I've also replaced for a smaller model. related optimum PR

amyeroberts

Thanks for fixing!

Do you know why this breaking change wasn't caught on the optimum side?
Can you explain a bit more how these changes fix the problem? The changes look OK but it's not clear how they link to the issue description.

amyeroberts · 2023-12-13T20:41:33Z

tests/quantization/gptq/test_gptq.py

                # we need to put it directly to the gpu. Otherwise, we won't be able to initialize the exllama kernel
                quantized_model_from_saved = AutoModelForCausalLM.from_pretrained(
                    tmpdirname, quantization_config=GPTQConfig(use_exllama=True, bits=4), device_map={"": 0}
                )
-                self.assertEqual(quantized_model_from_saved.config.quantization_config.use_exllama, True)


Why remove this line?

With this PR, we don't save all the arguments anymore (only those in self.serialization_keys) by modifying to_dict() . The issue with that is that we are updating the quantization_config based on the one from optimum: config.quantization_config = GPTQConfig.from_dict_optimum(quantizer.to_dict()).
This line was needed since some args could change like use_exllama in optimum.

I was thinking on doing a PR to remove this line, and maybe not save args related to inference anymore (use_exllama,...) or revert the PR on optimum. What are your thoughts ?

This line was needed since some args could change like use_exllama in optimum.

Sorry, I don't completely follow. Does this mean that use_exllama will no longer change and the test check is no longer required?

I was thinking on doing a PR to remove this line, and maybe not save args related to inference anymore (use_exllama,...)

It depends. This can be considered a breaking change, as users might now expect these values in their configs. The most important thing is for old configs to still be loadable and produce the same result.

Sorry, I don't completely follow. Does this mean that use_exllama will no longer change and the test check is no longer required?

Basically, I mean that the user can set use_exllama=True in transformers and this value can change in optimum (use_exllama=False). However, since we don't serialize it anymore in optimum GPTQ config, use_exllama will be set to the default value through: config.quantization_config = GPTQConfig.from_dict_optimum(quantizer.to_dict()).

It depends. This can be considered a breaking change, as users might now expect these values in their configs. The most important thing is for old configs to still be loadable and produce the same result.

Yes, the old configs will still work. However, for new users, they will have to pass these args each time.

I will probably work on the second option then since from the start, we should not have to let the user select the kernel since we can switch from one to another.

amyeroberts · 2023-12-13T20:41:39Z

tests/quantization/gptq/test_gptq.py

@@ -242,12 +244,11 @@ def test_change_loading_attributes(self):
        with tempfile.TemporaryDirectory() as tmpdirname:
            self.quantized_model.save_pretrained(tmpdirname)
            if not self.use_exllama:
-                self.assertEqual(self.quantized_model.config.quantization_config.use_exllama, False)


Why remove this line?

amyeroberts

Thanks for fixing the tests and explaining!

I'm approving as this is effectively just fixing the tests: the changes to GPTQ config have already been made and we haven't experience lots of users complaining. However, making changes like this can be breaking and cause a lot of issues downstream. Because of the coupling between optimum and the respective configs in transformers I'd expect both for changes in API to have backwards compatibility as part of their consideration, and for there to be sufficient tests run to make sure changes in optimum that affect transformers are caught before they're merged in.

amyeroberts · 2024-01-15T16:10:25Z

@SunMarc Do you have permissions to merge? If not, I can merge this in if it's good to go

SunMarc · 2024-01-15T16:22:27Z

I'll merge it ! thx for the reminder

* fix test * reduce length * smaller model

SunMarc added 2 commits December 13, 2023 20:39

fix test

4025cfd

reduce length

70ac8ce

SunMarc requested a review from amyeroberts December 13, 2023 20:36

amyeroberts reviewed Dec 13, 2023

View reviewed changes

smaller model

1be2c95

SunMarc requested a review from amyeroberts December 13, 2023 21:13

amyeroberts approved these changes Dec 18, 2023

View reviewed changes

huggingface deleted a comment from github-actions bot Jan 15, 2024

SunMarc merged commit 7c8dd88 into huggingface:main Jan 15, 2024
18 checks passed

MadElf1337 pushed a commit to MadElf1337/transformers that referenced this pull request Jan 15, 2024

[GPTQ] Fix test (huggingface#28018)

a91f9aa

* fix test * reduce length * smaller model

wgifford pushed a commit to wgifford/transformers that referenced this pull request Jan 21, 2024

[GPTQ] Fix test (huggingface#28018)

9837d61

* fix test * reduce length * smaller model

AjayP13 pushed a commit to AjayP13/transformers that referenced this pull request Jan 22, 2024

[GPTQ] Fix test (huggingface#28018)

00fd994

* fix test * reduce length * smaller model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPTQ] Fix test #28018

[GPTQ] Fix test #28018

SunMarc commented Dec 13, 2023 •

edited

Loading

amyeroberts left a comment

amyeroberts Dec 13, 2023

SunMarc Dec 13, 2023

amyeroberts Dec 15, 2023 •

edited

Loading

SunMarc Dec 18, 2023

amyeroberts Dec 13, 2023

SunMarc Dec 13, 2023

amyeroberts left a comment

amyeroberts commented Jan 15, 2024

SunMarc commented Jan 15, 2024

[GPTQ] Fix test #28018

[GPTQ] Fix test #28018

Conversation

SunMarc commented Dec 13, 2023 • edited Loading

What does this PR do ?

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Dec 13, 2023

Choose a reason for hiding this comment

SunMarc Dec 13, 2023

Choose a reason for hiding this comment

amyeroberts Dec 15, 2023 • edited Loading

Choose a reason for hiding this comment

SunMarc Dec 18, 2023

Choose a reason for hiding this comment

amyeroberts Dec 13, 2023

Choose a reason for hiding this comment

SunMarc Dec 13, 2023

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts commented Jan 15, 2024

SunMarc commented Jan 15, 2024

SunMarc commented Dec 13, 2023 •

edited

Loading

amyeroberts Dec 15, 2023 •

edited

Loading