[`Low-level-API`] Add docs about LLAPI #836

younesbelkada · 2023-08-18T09:21:05Z

What does this PR do?

As discussed internally, this PR adds nice documentation and simple tests about the recent low-level API from #749

cc @BenjaminBossan @pacman100

HuggingFaceDocBuilderDev · 2023-08-18T09:25:36Z

The documentation is not available anymore as the PR was closed or merged.

BenjaminBossan

Thanks for adding the docs and tests. They're very clear, I only have minor comments. Please take a look.

BenjaminBossan · 2023-08-18T09:33:00Z

README.md

@@ -355,6 +355,42 @@ any GPU memory savings. Please refer issue [[FSDP] FSDP with CPU offload consume

 2. When using ZeRO3 with zero3_init_flag=True, if you find the gpu memory increase with training steps. we might need to update deepspeed after [deepspeed commit 42858a9891422abc](https://github.com/microsoft/DeepSpeed/commit/42858a9891422abcecaa12c1bd432d28d33eb0d4) . The related issue is [[BUG] Peft Training with Zero.Init() and Zero3 will increase GPU memory every forward step ](https://github.com/microsoft/DeepSpeed/issues/3002)

+## 🤗 PEFT as a utility library
+
+Inject trainable adapters on any `torch` model using `inject_adapter_in_model` method:


Not sure exactly about the wording, but I think it's worth highlighting here that calling this function will only inject the adapters but make no further changes to the model. Otherwise, users may be confused why they should use this and not get_peft_model.

BenjaminBossan · 2023-08-18T09:34:20Z

README.md

+)
+
+model = DummyModel()
+model = inject_adapter_in_model(lora_config, model, "default")


Hmm, I wonder now why we did not have adapter_name="default" as a default argument? I think it would help here. If we don't want it, at least I would pass it as keyword argument in this example, not positional, to make it clear what the meaning of "default" is.

If we change the function to make the argument a default argument (ugh, so confusing, default vs "default"), the docs below also need to change a little bit ("that takes 3 arguments ...").

Sounds great, will address this change

BenjaminBossan · 2023-08-18T09:38:42Z

tests/test_low_level_api.py

+from peft import LoraConfig, get_peft_model_state_dict, inject_adapter_in_model
+
+
+class DummyModel(torch.nn.Module):


Great that you added tests.

I think the tests could also be added to test_custom_models.py. The disadvantage of that change would be that as is, the tests are very clear and straightforward. The advantage would be that the tests in test_custom_models.py can be easily parametrized with different custom modules and configs, so the test coverage is better.

I would be okay if you want to keep it as is, it's just a suggestion.

I see ok thanks for explaining! I would say maybe let's keep it as it is since the test is aimed to only test the tiny snippet of the README and the docs so I want to keep it very simple and minimal for now

BenjaminBossan · 2023-08-18T09:44:28Z

docs/source/developer_guides/low_level_api.mdx

+- Works for any torch module, and any modality (vision, text, multi-modal)
+
+Cons:
+- You need to manually writing saving and loading utility methods


Suggested change

- You need to manually writing saving and loading utility methods

- You need to manually write saving and loading utility methods

I think this could be confusing. I guess what you mean is stuff like from_pretrained etc. But people can still use the normal torch.save and torch.load that they already know. Saying they have to "manually" write the methods probably sounds worse than it actually is.

Hmm correct, let me rephrase this a bit then

BenjaminBossan · 2023-08-18T09:45:15Z

docs/source/developer_guides/low_level_api.mdx

+
+Cons:
+- You need to manually writing saving and loading utility methods
+- You cannot use any of the utility method provided by `PeftModel` such as disabling adapters, merging adapters, etc.


Maybe add a link to this section: https://huggingface.co/docs/peft/conceptual_guides/lora#utils-for-lora

Also, I took a look at some of the methods that are currently used for merging, unloading etc. and I think that with only a few changes, we can make them standalone functions (like inject_adapter_in_model) that don't require a PeftModel / LoraModel etc. At least for LoRA that should work.

Sounds great, we can do that in a follow up PR!

younesbelkada · 2023-08-18T10:20:58Z

Thanks for the extensive review @BenjaminBossan ! Should have addressed the comments by now

BenjaminBossan

Fantastic, thanks for making the changes.

add docs about LLAPI

a94df38

younesbelkada requested review from BenjaminBossan and pacman100 August 18, 2023 09:21

BenjaminBossan requested changes Aug 18, 2023

View reviewed changes

address comments

57eecf4

younesbelkada requested a review from BenjaminBossan August 18, 2023 10:20

BenjaminBossan approved these changes Aug 18, 2023

View reviewed changes

younesbelkada merged commit 4b371b4 into huggingface:main Aug 18, 2023

younesbelkada deleted the add-low-level-api-docs branch August 18, 2023 14:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`Low-level-API`] Add docs about LLAPI #836

[`Low-level-API`] Add docs about LLAPI #836

younesbelkada commented Aug 18, 2023

HuggingFaceDocBuilderDev commented Aug 18, 2023 •

edited

Loading

BenjaminBossan left a comment

BenjaminBossan Aug 18, 2023

BenjaminBossan Aug 18, 2023

younesbelkada Aug 18, 2023

BenjaminBossan Aug 18, 2023

younesbelkada Aug 18, 2023 •

edited

Loading

BenjaminBossan Aug 18, 2023

younesbelkada Aug 18, 2023

BenjaminBossan Aug 18, 2023

younesbelkada Aug 18, 2023

younesbelkada commented Aug 18, 2023

BenjaminBossan left a comment

		from peft import LoraConfig, get_peft_model_state_dict, inject_adapter_in_model


		class DummyModel(torch.nn.Module):

	- You need to manually writing saving and loading utility methods
	- You need to manually write saving and loading utility methods

[Low-level-API] Add docs about LLAPI #836

[Low-level-API] Add docs about LLAPI #836

Conversation

younesbelkada commented Aug 18, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented Aug 18, 2023 • edited Loading

BenjaminBossan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

younesbelkada Aug 18, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

younesbelkada commented Aug 18, 2023

BenjaminBossan left a comment

Choose a reason for hiding this comment

[`Low-level-API`] Add docs about LLAPI #836

[`Low-level-API`] Add docs about LLAPI #836

HuggingFaceDocBuilderDev commented Aug 18, 2023 •

edited

Loading

younesbelkada Aug 18, 2023 •

edited

Loading