[`core` / `PEFT` / `LoRA`] Integrate PEFT into Unet #5151

younesbelkada · 2023-09-22T16:48:26Z

What does this PR do?

Second (actually third) step of PEFT integration into diffusers, this time PEFT is integrated on the Unet

The testing script is still the same, works with and without scale as expected

from diffusers import StableDiffusionPipeline
import os
import torch

path = "runwayml/stable-diffusion-v1-5"
lora_id = "takuma104/lora-test-text-encoder-lora-target"

pipe = StableDiffusionPipeline.from_pretrained(path, torch_dtype=torch.float16)
pipe.load_lora_weights(lora_id)
pipe = pipe.to("cuda")

prompt = "a red sks dog"

images = pipe(
    prompt=prompt, 
    num_inference_steps=15, 
    cross_attention_kwargs={"scale": 0.5},
    generator=torch.manual_seed(0)
).images


for i, image in enumerate(images):
    file_name = f"aa_{i}"
    os.makedirs("images-integration", exist_ok=True)
    path = os.path.join("images-integration", f"{file_name}-new-peft-2.png")
    image.save(path)

and this should return:

TODOs:

deal with failing tests
add tests
think of a better way to deal with scaling / unscaling (decorator, ..?) inside call
add fuse-unfuse tests for unet + TE
add multi-adapter tests for text encoder
add multi-adapter tests for text encoder + unet
test it with Kohya checkpoints
add deprecation messages inside LoRACompatiblexxx ?

cc @sayakpaul @patrickvonplaten @pacman100 @BenjaminBossan

src/diffusers/models/modeling_utils.py

HuggingFaceDocBuilderDev · 2023-09-25T16:46:50Z

The documentation is not available anymore as the PR was closed or merged.

src/diffusers/utils/peft_utils.py

patrickvonplaten · 2023-09-26T11:59:59Z

@younesbelkada let me know if you'd like me to do a review here

younesbelkada

Thanks @patrickvonplaten indeed I have a question here, since the changes made in pipeline_xxx needs to be copied over all the pipeline files I was wondering which approach looks best. @BenjaminBossan suggested offline that the second approach is better - which I also agree - let me know what do you think!

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

src/diffusers/pipelines/stable_diffusion_xl/pipeline_stable_diffusion_xl.py

sayakpaul · 2023-09-26T12:07:08Z

I slightly prefer the second approach as I am not a big fan of using context managers for these kinds of use cases. I think it's better to be a bit verbose here.

src/diffusers/models/lora.py

tests/lora/test_lora_layers_peft.py

younesbelkada · 2023-10-12T16:26:41Z

Thanks for all the reviews!
Regarding the test I have modified, if huggingface/peft#1017 gets merged, the test will pass on CPU (the test are used to ran on CPU). Since that test used to pass on CPU I think that usecase should be supported to avoid potential regressions.

The reason it used to pass is simply because with the old backend lora weights are always upcasted in fp32: https://github.com/huggingface/diffusers/blob/main/src/diffusers/models/lora.py#L257
Otherwise torch.bmm would fail:

>>> import torch
>>> t1 = torch.randn(1, 1).to(torch.float16)
>>> t2 = torch.randn(1, 1).to(torch.float16)
>>> torch.bmm(t1[None, :], t2[None, :])
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
RuntimeError: "bmm" not implemented for 'Half'

BenjaminBossan

Thanks, this looks really good from my POV. I still left a few comments but those can be safely ignored. I also still think that get_list_adapters could be a confusing name to users, maybe you can find something better.

src/diffusers/loaders.py

BenjaminBossan · 2023-10-13T10:37:53Z

src/diffusers/loaders.py

+                        if not is_model_cpu_offload:
+                            is_model_cpu_offload = isinstance(component._hf_hook, CpuOffload)
+                        if not is_sequential_cpu_offload:
+                            is_sequential_cpu_offload = isinstance(component._hf_hook, AlignDevicesHook)


This would have the same effect in two lines, but your version is probably easier to understand, just wanted to throw this out there.

Suggested change

if not is_model_cpu_offload:

is_model_cpu_offload = isinstance(component._hf_hook, CpuOffload)

if not is_sequential_cpu_offload:

is_sequential_cpu_offload = isinstance(component._hf_hook, AlignDevicesHook)

is_model_cpu_offload |= isinstance(component._hf_hook, CpuOffload)

is_sequential_cpu_offload |= isinstance(component._hf_hook, AlignDevicesHook)

Would prefer the current implementation as is in favor of easier readability.

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

sayakpaul

Looks good to me, barring the pending comments. Thanks so much for iterating!

tests/lora/test_lora_layers_old_backend.py

patrickvonplaten · 2023-10-13T14:46:46Z

Let's merge the PR and make sure we monitor the fast & slow test here

patrickvonplaten · 2023-10-13T14:46:59Z

Great job @younesbelkada and everybody involved here!

younesbelkada · 2023-10-13T15:19:03Z

Nice, thanks!
The slow CUDA lora old backend tests seem to pass: https://github.com/huggingface/diffusers/actions/runs/6509717155/job/17681829144 however the slow lora new backend are not run I think. I am happy to add them in the workflow, do you want me to open a PR for it?

patrickvonplaten · 2023-10-16T12:56:33Z

Nice, thanks! The slow CUDA lora old backend tests seem to pass: https://github.com/huggingface/diffusers/actions/runs/6509717155/job/17681829144 however the slow lora new backend are not run I think. I am happy to add them in the workflow, do you want me to open a PR for it?

Yes this would be great cc @DN6 here

* v1 * add tests and fix previous failing tests * fix CI * add tests + v1 `PeftLayerScaler` * style * add scale retrieving mechanism system * fix CI * up * up * simple approach --> not same results for some reason * fix issues * fix copies * remove unneeded method * active adapters! * fix merge conflicts * up * up * kohya - test-1 * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix scale * fix copies * add comment * multi adapters * fix tests * oops * v1 faster loading - in progress * Revert "v1 faster loading - in progress" This reverts commit ac925f8. * kohya same generation * fix some slow tests * peft integration features for unet lora 1. Support for Multiple ranks/alphas 2. Support for Multiple active adapters 3. Support for enabling/disabling LoRAs * fix `get_peft_kwargs` * Update loaders.py * add some tests * add unfuse tests * fix tests * up * add set adapter from sourab and tests * fix multi adapter tests * style & quality * style * remove comment * fix `adapter_name` issues * fix unet adapter name for sdxl * fix enabling/disabling adapters * fix fuse / unfuse unet * nit * fix * up * fix cpu offloading * fix another slow test * fix another offload test * add more tests * all slow tests pass * style * fix alpha pattern for unet and text encoder * Update src/diffusers/loaders.py Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * Update src/diffusers/models/attention.py Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * up * up * clarify comment * comments * change comment order * change comment order * stylr & quality * Update tests/lora/test_lora_layers_peft.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix bugs and add tests * Update src/diffusers/models/modeling_utils.py Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * Update src/diffusers/models/modeling_utils.py Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * refactor * suggestion * add break statemebt * add compile tests * move slow tests to peft tests as I modified them * quality * refactor a bit * style * change import * style * fix CI * refactor slow tests one last time * style * oops * oops * oops * final tweak tests * Apply suggestions from code review Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update src/diffusers/loaders.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * comments * Apply suggestions from code review Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * remove comments * more comments * try * revert * add `safe_merge` tests * add comment * style, comments and run tests in fp16 * add warnings * fix doc test * replace with `adapter_weights` * add `get_active_adapters()` * expose `get_list_adapters` method * better error message * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * style * trigger slow lora tests * fix tests * maybe fix last test * revert * Update src/diffusers/loaders.py Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * Update src/diffusers/loaders.py Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * Update src/diffusers/loaders.py Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * Update src/diffusers/loaders.py Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * move `MIN_PEFT_VERSION` * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * let's not use class variable * fix few nits * change a bit offloading logic * check earlier * rm unneeded block * break long line * return empty list * change logic a bit and address comments * add typehint * remove parenthesis * fix * revert to fp16 in tests * add to gpu * revert to old test * style * Update src/diffusers/loaders.py Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * change indent * Apply suggestions from code review * Apply suggestions from code review --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com> Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

younesbelkada added 2 commits September 22, 2023 16:45

v1

cf2c0ba

add tests and fix previous failing tests

8759f55

younesbelkada commented Sep 25, 2023

View reviewed changes

src/diffusers/models/modeling_utils.py Outdated Show resolved Hide resolved

younesbelkada and others added 5 commits September 25, 2023 15:42

fix CI

c90aedc

Merge remote-tracking branch 'upstream/main' into peft-part-2

0bfb136

add tests + v1 PeftLayerScaler

3002ea3

Merge branch 'main' into peft-part-2

d6f500c

style

64ca2bb

add scale retrieving mechanism system

f62e506

younesbelkada commented Sep 25, 2023

View reviewed changes

src/diffusers/utils/peft_utils.py Outdated Show resolved Hide resolved

younesbelkada added 3 commits September 25, 2023 17:01

fix CI

48842c0

Merge remote-tracking branch 'upstream/main' into peft-part-2

71d4990

up

1fb4aa2

younesbelkada commented Sep 26, 2023

View reviewed changes

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py Outdated Show resolved Hide resolved

src/diffusers/pipelines/stable_diffusion_xl/pipeline_stable_diffusion_xl.py Outdated Show resolved Hide resolved

younesbelkada added 5 commits September 27, 2023 08:56

up

4c803f6

simple approach --> not same results for some reason

11a493a

fix issues

4ea8959

fix copies

16b1161

remove unneeded method

b3a02be

younesbelkada commented Sep 27, 2023

View reviewed changes

src/diffusers/models/lora.py Outdated Show resolved Hide resolved

younesbelkada commented Sep 27, 2023

View reviewed changes

tests/lora/test_lora_layers_peft.py Show resolved Hide resolved

active adapters!

cc135f2

younesbelkada mentioned this pull request Sep 27, 2023

PEFT Integration for Text Encoder to handle multiple alphas/ranks, disable/enable adapters and support for multiple adapters #5147

Merged

younesbelkada and others added 4 commits September 27, 2023 15:06

Merge branch 'main' into peft-part-2

5c493e5

fix merge conflicts

a09530c

up

d3ce092

up

9e500d2

fix

44f658d

younesbelkada mentioned this pull request Oct 12, 2023

FEAT: Add fp16 + cpu merge support huggingface/peft#1017

Merged

revert to fp16 in tests

7fd50a7

add to gpu

e92c6de

younesbelkada requested review from patrickvonplaten, BenjaminBossan and sayakpaul October 12, 2023 16:29

sayakpaul mentioned this pull request Oct 13, 2023

Use of LoRa sliders #5352

Closed

younesbelkada and others added 3 commits October 13, 2023 10:24

revert to old test

4e382ee

style

6ae767f

Merge branch 'main' into peft-part-2

a0f976e

BenjaminBossan approved these changes Oct 13, 2023

View reviewed changes

Update src/diffusers/loaders.py

b4e1381

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

sayakpaul approved these changes Oct 13, 2023

View reviewed changes

change indent

f708dba

patrickvonplaten reviewed Oct 13, 2023

View reviewed changes

tests/lora/test_lora_layers_old_backend.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Oct 13, 2023

View reviewed changes

tests/lora/test_lora_layers_old_backend.py Outdated Show resolved Hide resolved

Apply suggestions from code review

f17206c

patrickvonplaten reviewed Oct 13, 2023

View reviewed changes

tests/lora/test_lora_layers_old_backend.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Oct 13, 2023

View reviewed changes

tests/lora/test_lora_layers_old_backend.py Outdated Show resolved Hide resolved

Apply suggestions from code review

950d19c

patrickvonplaten merged commit 2bfa55f into huggingface:main Oct 13, 2023

younesbelkada deleted the peft-part-2 branch October 13, 2023 15:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`core` / `PEFT` / `LoRA`] Integrate PEFT into Unet #5151

[`core` / `PEFT` / `LoRA`] Integrate PEFT into Unet #5151

younesbelkada commented Sep 22, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 25, 2023 •

edited

Loading

patrickvonplaten commented Sep 26, 2023

younesbelkada left a comment

sayakpaul commented Sep 26, 2023

younesbelkada commented Oct 12, 2023 •

edited

Loading

BenjaminBossan left a comment

BenjaminBossan Oct 13, 2023

sayakpaul Oct 13, 2023

sayakpaul left a comment

patrickvonplaten commented Oct 13, 2023

patrickvonplaten commented Oct 13, 2023

younesbelkada commented Oct 13, 2023

patrickvonplaten commented Oct 16, 2023

[core / PEFT / LoRA] Integrate PEFT into Unet #5151

[core / PEFT / LoRA] Integrate PEFT into Unet #5151

Conversation

younesbelkada commented Sep 22, 2023 • edited Loading

What does this PR do?

TODOs:

HuggingFaceDocBuilderDev commented Sep 25, 2023 • edited Loading

patrickvonplaten commented Sep 26, 2023

younesbelkada left a comment

Choose a reason for hiding this comment

sayakpaul commented Sep 26, 2023

younesbelkada commented Oct 12, 2023 • edited Loading

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan Oct 13, 2023

Choose a reason for hiding this comment

sayakpaul Oct 13, 2023

Choose a reason for hiding this comment

sayakpaul left a comment

Choose a reason for hiding this comment

patrickvonplaten commented Oct 13, 2023

patrickvonplaten commented Oct 13, 2023

younesbelkada commented Oct 13, 2023

patrickvonplaten commented Oct 16, 2023

[`core` / `PEFT` / `LoRA`] Integrate PEFT into Unet #5151

[`core` / `PEFT` / `LoRA`] Integrate PEFT into Unet #5151

younesbelkada commented Sep 22, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 25, 2023 •

edited

Loading

younesbelkada commented Oct 12, 2023 •

edited

Loading