[`Core`] Change 8-bit serialization weight format format #1164

younesbelkada · 2024-04-04T15:56:50Z

Currently for 8-bit layers the weight format are saved in pure str in the state dict, which is no longer supported in transformers

This PR should be totally backward compatible with previous 8-bit weights pushed on the Hub

cc @Titus-von-Koeller @TimDettmers @SunMarc

github-actions · 2024-04-04T16:05:11Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Titus-von-Koeller · 2024-04-08T09:15:25Z

@younesbelkada Thanks a lot for your work on this ❤️

I'm waiting with my final review and merge until you ping me, as you I remember you saying this is potentially still in flux.

younesbelkada · 2024-04-08T09:36:22Z

Thanks @Titus-von-Koeller ! yes let me run few other tests and ping you here

bitsandbytes/nn/modules.py

bitsandbytes/utils.py

bitsandbytes/nn/modules.py

akx · 2024-04-09T10:04:29Z

bitsandbytes/nn/modules.py

+    if isinstance(weight_format, torch.Tensor):
+        weight_format = weight_format.item()
+
+    # For new weights format storage type we expclicitly check
+    # if weights_format is on the mapping
+    if isinstance(weight_format, int) and weight_format not in LINEAR_8BIT_WEIGHTS_FORMAT_MAPPING.values():
+        raise ValueError(f"Expected supported weight format - got {weight_format}")
+    elif isinstance(weight_format, int) and weight_format in LINEAR_8BIT_WEIGHTS_FORMAT_MAPPING.values():
+        weight_format = dict(
+            zip(LINEAR_8BIT_WEIGHTS_FORMAT_MAPPING.values(), LINEAR_8BIT_WEIGHTS_FORMAT_MAPPING.keys())
+        )[weight_format]
+


622..633 looks like it should be a free function (determine_weight_format() or similar).

akx · 2024-04-09T10:04:59Z

There should probably be at least some tests for this in this repo too.

Titus-von-Koeller · 2024-04-09T17:26:12Z

Thanks @akx for the review! Good catches. I agree that these changes would make sense.

@younesbelkada In case you're too busy tmr, I can wrap this up for you and come up with some simple tests. Anyways, we can release this by Thursday, no problem, so that we're aligned with the Transformers release.

For now, instead of this, I'll focus on some multi-backend related topics as these are critical to be well prepared for the meeting on that topic tmr evening.

Co-authored-by: Aarni Koskela <akx@iki.fi>

…rs/bitsandbytes into fix-8bit-serialization

akx · 2024-04-10T17:23:21Z

bitsandbytes/nn/modules.py

+    # For new weights format storage type, we explicitly check
+    # if weights_format is on the mapping
+    if isinstance(weight_format, int) and weight_format not in INVERSE_LINEAR_8BIT_WEIGHTS_FORMAT_MAPPING:
+        raise ValueError(f"Expected supported weight format - got {weight_format}")
+    elif isinstance(weight_format, int) and weight_format in INVERSE_LINEAR_8BIT_WEIGHTS_FORMAT_MAPPING:
+        weight_format = INVERSE_LINEAR_8BIT_WEIGHTS_FORMAT_MAPPING[weight_format]


This doesn't seem to make much sense? Why is the isinstance() check repeated?

akx · 2024-04-10T17:23:44Z

bitsandbytes/nn/modules.py

+    if isinstance(weight_format, torch.Tensor):
+        weight_format = weight_format.item()
+
+    # For new weights format storage type, we explicitly check
+    # if weights_format is on the mapping
+    if isinstance(weight_format, int) and weight_format not in INVERSE_LINEAR_8BIT_WEIGHTS_FORMAT_MAPPING:
+        raise ValueError(f"Expected supported weight format - got {weight_format}")
+    elif isinstance(weight_format, int) and weight_format in INVERSE_LINEAR_8BIT_WEIGHTS_FORMAT_MAPPING:
+        weight_format = INVERSE_LINEAR_8BIT_WEIGHTS_FORMAT_MAPPING[weight_format]
+


As said before, this should probably be a free-standing helper function.

…s-foundation#1164) * change 8-bit serialization weight format format * precimmit * pre-commit * fix * Update bitsandbytes/nn/modules.py Co-authored-by: Aarni Koskela <akx@iki.fi> * Update bitsandbytes/nn/modules.py Co-authored-by: Aarni Koskela <akx@iki.fi> * Update bitsandbytes/utils.py Co-authored-by: Aarni Koskela <akx@iki.fi> * address feedback * lint --------- Co-authored-by: Aarni Koskela <akx@iki.fi>

change 8-bit serialization weight format format

e93bb3f

younesbelkada mentioned this pull request Apr 4, 2024

FIX: Fix 8-bit serialization tests huggingface/transformers#30051

Merged

younesbelkada requested review from TimDettmers and Titus-von-Koeller April 4, 2024 16:05

precimmit

c4d8af2

younesbelkada added 2 commits April 8, 2024 11:12

pre-commit

4bf0af5

fix

8a5668d

akx reviewed Apr 9, 2024

View reviewed changes

younesbelkada and others added 6 commits April 10, 2024 10:36

Update bitsandbytes/nn/modules.py

ff8b9a7

Co-authored-by: Aarni Koskela <akx@iki.fi>

Update bitsandbytes/nn/modules.py

9956f5b

Co-authored-by: Aarni Koskela <akx@iki.fi>

Update bitsandbytes/utils.py

3272a28

Co-authored-by: Aarni Koskela <akx@iki.fi>

address feedback

e92be59

Merge branch 'fix-8bit-serialization' of https://github.com/TimDettme…

8f2f57b

…rs/bitsandbytes into fix-8bit-serialization

lint

6006274

Titus-von-Koeller merged commit 7449d71 into main Apr 10, 2024
35 checks passed

Titus-von-Koeller deleted the fix-8bit-serialization branch April 10, 2024 08:50

akx reviewed Apr 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`Core`] Change 8-bit serialization weight format format #1164

[`Core`] Change 8-bit serialization weight format format #1164

younesbelkada commented Apr 4, 2024 •

edited

Loading

github-actions bot commented Apr 4, 2024

Titus-von-Koeller commented Apr 8, 2024

younesbelkada commented Apr 8, 2024

akx Apr 9, 2024

akx commented Apr 9, 2024

Titus-von-Koeller commented Apr 9, 2024

akx Apr 10, 2024

akx Apr 10, 2024

[Core] Change 8-bit serialization weight format format #1164

[Core] Change 8-bit serialization weight format format #1164

Conversation

younesbelkada commented Apr 4, 2024 • edited Loading

github-actions bot commented Apr 4, 2024

Titus-von-Koeller commented Apr 8, 2024

younesbelkada commented Apr 8, 2024

akx Apr 9, 2024

Choose a reason for hiding this comment

akx commented Apr 9, 2024

Titus-von-Koeller commented Apr 9, 2024

akx Apr 10, 2024

Choose a reason for hiding this comment

akx Apr 10, 2024

Choose a reason for hiding this comment

[`Core`] Change 8-bit serialization weight format format #1164

[`Core`] Change 8-bit serialization weight format format #1164

younesbelkada commented Apr 4, 2024 •

edited

Loading