[`Docs`] Add 4-bit serialization docs #28182

younesbelkada · 2023-12-21T13:28:55Z

What does this PR do?

Follow up work from: #26037
Adds few lines in the documentation about serializing 4-bit models on the Hub

HuggingFaceDocBuilderDev · 2023-12-21T13:49:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

stevhliu · 2023-12-21T16:04:30Z

docs/source/en/quantization.md

@@ -468,6 +468,10 @@ Try 4-bit quantization in this [notebook](https://colab.research.google.com/driv

 This section explores some of the specific features of 4-bit models, such as changing the compute data type, using the Normal Float 4 (NF4) data type, and using nested quantization.

+#### Push 4-bit models on the Hub
+
+If you have `bitsandbytes>=0.41.3`, you can serialize 4-bit models and push them on Hugging Face Hub. Simply call `model.push_to_hub()` after loading it in 4-bit precision.


Thanks for the update! I think it would be nicer to include this here so it parallels the 8-bit section :)

AH thanks! good point, will do it now

Done, maybe it is a bit repetitive, lmk what do you think!

Nice, and then we can remove the #### Push 4-bit models on the Hub section so its not as repetitive

Okay , done!

stevhliu

LGTM, thanks!

amyeroberts

Great - thanks for adding!

* add 4-bit serialization docs * up * up

add 4-bit serialization docs

bb54e64

younesbelkada changed the title ~~[add 4-bit serialization docs~~ [Docs] Add 4-bit serialization docs Dec 21, 2023

younesbelkada requested a review from stevhliu December 21, 2023 13:30

younesbelkada assigned amyeroberts and unassigned amyeroberts Dec 21, 2023

younesbelkada requested a review from amyeroberts December 21, 2023 13:31

stevhliu approved these changes Dec 21, 2023

View reviewed changes

up

33e5b00

younesbelkada requested a review from stevhliu December 21, 2023 16:10

up

0980f27

stevhliu approved these changes Dec 21, 2023

View reviewed changes

amyeroberts approved these changes Dec 21, 2023

View reviewed changes

younesbelkada merged commit 3a8769f into huggingface:main Dec 22, 2023
8 checks passed

younesbelkada deleted the add-4bit-ser-docs branch December 22, 2023 09:18

staghado pushed a commit to staghado/transformers that referenced this pull request Jan 15, 2024

[Docs] Add 4-bit serialization docs (huggingface#28182)

76ecbf5

* add 4-bit serialization docs * up * up

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`Docs`] Add 4-bit serialization docs #28182

[`Docs`] Add 4-bit serialization docs #28182

younesbelkada commented Dec 21, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 21, 2023

stevhliu Dec 21, 2023

younesbelkada Dec 21, 2023

younesbelkada Dec 21, 2023

stevhliu Dec 21, 2023

younesbelkada Dec 21, 2023

stevhliu left a comment

amyeroberts left a comment

[Docs] Add 4-bit serialization docs #28182

[Docs] Add 4-bit serialization docs #28182

Conversation

younesbelkada commented Dec 21, 2023 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Dec 21, 2023

stevhliu Dec 21, 2023

Choose a reason for hiding this comment

younesbelkada Dec 21, 2023

Choose a reason for hiding this comment

younesbelkada Dec 21, 2023

Choose a reason for hiding this comment

stevhliu Dec 21, 2023

Choose a reason for hiding this comment

younesbelkada Dec 21, 2023

Choose a reason for hiding this comment

stevhliu left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

[`Docs`] Add 4-bit serialization docs #28182

[`Docs`] Add 4-bit serialization docs #28182

younesbelkada commented Dec 21, 2023 •

edited

Loading