FEAT: Make safe serialization the default one #1088

younesbelkada · 2023-11-07T10:47:20Z

What does this PR do?

As per title, to be in line with the current efforts in the OSS ecosystem let's also make in PEFT safe serialization the default behaviour

Adapted the tests and added some regression tests!

cc @BenjaminBossan @pacman100

younesbelkada · 2023-11-07T10:48:23Z

tests/testing_common.py


            # check if `adapter_config.json` is present
            self.assertTrue(os.path.exists(os.path.join(tmp_dirname, "adapter_config.json")))

-            # check if `pytorch_model.bin` is not present
-            self.assertFalse(os.path.exists(os.path.join(tmp_dirname, "pytorch_model.bin")))
+            # check if `model.safetensors` is not present


This change in necessary as now in transformers we don't save anymore pytorch-model.bin but the safetensors model

HuggingFaceDocBuilderDev · 2023-11-07T10:52:18Z

The documentation is not available anymore as the PR was closed or merged.

BenjaminBossan

No full review yet, but some general questions:

Do we want to do a deprecation/future warning cycle before making safetensors the default? How was it done in transformers?
Do we want to run _test_save_pretrained twice, once with safetensors, once with pytorch?

younesbelkada · 2023-11-07T11:27:04Z

Thanks @BenjaminBossan !

not sure how they did it in transformers, let me double check that

Do we want to run _test_save_pretrained twice, once with safetensors, once with pytorch?

Yes this is what I did in the PR, I kept test_save_pretrained as is and replaced the expected file names with safetensors file names and added a new test to check if one calls save_pretrained with safe_serialization=False it leads to the same behavior as we had before this PR

BenjaminBossan · 2023-11-07T11:40:46Z

I kept test_save_pretrained as is and replaced the expected file names with safetensors file names and added a new test to check if one calls save_pretrained with safe_serialization=False it leads to the same behavior as we had before this PR

Oh I see. I wonder if _test_save_pretrained could not be changed slightly so that it takes an argument use_safetensors or something like that which can be parameterized. Then the same test can be re-used without duplicating it whole.

I think the new test is not quite a regression test, as regression testing would require the model to be persisted with one PEFT version and loaded with another PEFT version. I think we should really try to advance #995, which adds regression tests which would also help cover this case.

younesbelkada · 2023-11-07T12:31:50Z

tests/testing_common.py

@@ -287,7 +287,10 @@ def _test_save_pretrained(self, model_id, config_cls, config_kwargs):
        model = model.to(self.torch_device)

        with tempfile.TemporaryDirectory() as tmp_dirname:
-            model.save_pretrained(tmp_dirname)
+            if safe_serialization:


I prefer to not pass safe_serialization=safe_serialization in save_pretrained to test the native behaviour, let me know if this makes sense

You mean the default behavior? I think it's okay either way.

yes sorry I meant the default behaviour

younesbelkada · 2023-11-09T14:47:39Z

We are post the patch release now and merged this branch with main, this PR is ready for another review!

HuggingFaceDocBuilderDev · 2023-11-09T14:53:06Z

The documentation is not available anymore as the PR was closed or merged.

pacman100

Thank you @younesbelkada for making safetensors the default format for saving the adapter weights 🔐! This makes it safe by avoiding the risk of random code execution cases when using pickled format.

BenjaminBossan

Thanks for making the switch and adjusting the tests. I think we can merge now, although I would imagine some users will be surprised that their code suddenly produces safetensors. We should put a big note about this on the release notes once we publish the next version.

younesbelkada added 2 commits November 7, 2023 10:35

make safe serialization the default one

8e924fb

adapt tests

43c8f69

younesbelkada commented Nov 7, 2023

View reviewed changes

fix final tests'

449d657

younesbelkada requested review from BenjaminBossan and pacman100 November 7, 2023 11:18

BenjaminBossan reviewed Nov 7, 2023

View reviewed changes

adapt from suggestion

6c88a13

younesbelkada commented Nov 7, 2023

View reviewed changes

younesbelkada requested a review from BenjaminBossan November 7, 2023 12:43

Merge remote-tracking branch 'upstream/main' into make-st-default

06ace9b

younesbelkada closed this Nov 9, 2023

younesbelkada reopened this Nov 9, 2023

pacman100 approved these changes Nov 13, 2023

View reviewed changes

BenjaminBossan approved these changes Nov 15, 2023

View reviewed changes

younesbelkada merged commit 3ff9062 into huggingface:main Nov 15, 2023

younesbelkada deleted the make-st-default branch November 15, 2023 10:21

stanleee5 mentioned this pull request Dec 12, 2023

serving local PEFT checkpoints from peft 0.7 huggingface/text-generation-inference#1333

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Make safe serialization the default one #1088

FEAT: Make safe serialization the default one #1088

younesbelkada commented Nov 7, 2023

younesbelkada Nov 7, 2023

HuggingFaceDocBuilderDev commented Nov 7, 2023 •

edited

Loading

BenjaminBossan left a comment

younesbelkada commented Nov 7, 2023

BenjaminBossan commented Nov 7, 2023

younesbelkada Nov 7, 2023 •

edited

Loading

BenjaminBossan Nov 7, 2023

younesbelkada Nov 7, 2023

younesbelkada commented Nov 9, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 9, 2023 •

edited

Loading

pacman100 left a comment

BenjaminBossan left a comment

FEAT: Make safe serialization the default one #1088

FEAT: Make safe serialization the default one #1088

Conversation

younesbelkada commented Nov 7, 2023

What does this PR do?

younesbelkada Nov 7, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 7, 2023 • edited Loading

BenjaminBossan left a comment

Choose a reason for hiding this comment

younesbelkada commented Nov 7, 2023

BenjaminBossan commented Nov 7, 2023

younesbelkada Nov 7, 2023 • edited Loading

Choose a reason for hiding this comment

BenjaminBossan Nov 7, 2023

Choose a reason for hiding this comment

younesbelkada Nov 7, 2023

Choose a reason for hiding this comment

younesbelkada commented Nov 9, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Nov 9, 2023 • edited Loading

pacman100 left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 7, 2023 •

edited

Loading

younesbelkada Nov 7, 2023 •

edited

Loading

younesbelkada commented Nov 9, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 9, 2023 •

edited

Loading