[`bnb`] Fix bnb skip modules #24043

younesbelkada · 2023-06-06T10:21:43Z

What does this PR do?

Fixes #24037

#23479 removed by mistake the logic introduced in #21579 to deal with modules that are not needed to be converted

The PR also adds a nice test to make sure this will never happen again

HuggingFaceDocBuilderDev · 2023-06-06T10:42:59Z

The documentation is not available anymore as the PR was closed or merged.

amyeroberts

Thanks for fixing!

amyeroberts · 2023-06-06T12:52:32Z

tests/bitsandbytes/test_mixed_int8.py

+        )
+        self.assertTrue(isinstance(seq_classification_model.classifier.dense, nn.Linear))
+        self.assertTrue(isinstance(seq_classification_model.classifier.out_proj, nn.Linear))
+


We should also check at least one other layer not in llm_int8_skip_modules is loaded in 8bit. Ideally one which will effectively check the recursion logic.

Awesome yes agreed! Will add that now

amyeroberts · 2023-06-06T13:16:20Z

tests/bitsandbytes/test_mixed_int8.py

+        seq_classification_model = AutoModelForSequenceClassification.from_pretrained(
+            "roberta-large-mnli", quantization_config=quantization_config
+        )
+        self.assertTrue(isinstance(seq_classification_model.classifier.dense, nn.Linear))


Just for my own understanding (not a comment to address), here we're checking the layers of the classifier are nn.Linear. In test_linear_are_8bit, we check that the layers are nn.Linear too and that their dtype is torch.int8 (I didn't know this was possible!). Are we certain that this means these layers are loaded in correctly? Do we need a dtype check on the weights?

You are right, we also need a dtype check on the weights! Linear8bitLt has nn.Linear as a super class. Adding new tests!

…dules

* fix skip modules test * oops * address comments

fix skip modules test

bef385c

younesbelkada changed the title ~~[bnb] Fix bnbskip modules~~ [bnb] Fix bnb skip modules Jun 6, 2023

younesbelkada mentioned this pull request Jun 6, 2023

BitsAndBytesConfig llm_int8_skip_modules does not work in the new version #24037

Closed

4 tasks

oops

180bf80

younesbelkada requested a review from amyeroberts June 6, 2023 10:27

amyeroberts approved these changes Jun 6, 2023

View reviewed changes

younesbelkada added 2 commits June 7, 2023 12:43

address comments

50bccdc

Merge remote-tracking branch 'upstream/main' into fix-bnb-skip-llm-mo…

c112609

…dules

younesbelkada merged commit 4795219 into huggingface:main Jun 7, 2023

younesbelkada deleted the fix-bnb-skip-llm-modules branch June 7, 2023 13:27

novice03 pushed a commit to novice03/transformers that referenced this pull request Jun 23, 2023

[bnb] Fix bnb skip modules (huggingface#24043)

817a510

* fix skip modules test * oops * address comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`bnb`] Fix bnb skip modules #24043

[`bnb`] Fix bnb skip modules #24043

younesbelkada commented Jun 6, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 6, 2023 •

edited

Loading

amyeroberts left a comment

amyeroberts Jun 6, 2023

younesbelkada Jun 7, 2023

amyeroberts Jun 6, 2023

younesbelkada Jun 7, 2023

[bnb] Fix bnb skip modules #24043

[bnb] Fix bnb skip modules #24043

Conversation

younesbelkada commented Jun 6, 2023 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jun 6, 2023 • edited Loading

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Jun 6, 2023

Choose a reason for hiding this comment

younesbelkada Jun 7, 2023

Choose a reason for hiding this comment

amyeroberts Jun 6, 2023

Choose a reason for hiding this comment

younesbelkada Jun 7, 2023

Choose a reason for hiding this comment

[`bnb`] Fix bnb skip modules #24043

[`bnb`] Fix bnb skip modules #24043

younesbelkada commented Jun 6, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 6, 2023 •

edited

Loading