Mixed adapters 2 #3

BenjaminBossan · 2023-11-30T16:22:37Z

No description provided.

This PR is a follow up to, and supersedes, huggingface#1069. That PR is no longer necessary, as the base layer refactor PR has been merged. Description This PR allows to add adapters of different types, e.g. LoRA and LoHa: base_model = ... config0 = LoraConfig(...) peft_model = get_peft_model(base_model, config0, mixed=True) config1 = LoHaConfig(...) peft_model.add_adapter(config1, "other") peft_model.set_adapter(["default", "other"]) peft_model(x) At this point, both adapters are active at the same time. Existing code should not be affected by this change, since users need to opt into this behavior by setting mixed=True, and a completely different class is being used (PeftMixedModel). Also interesting is that this method can be used for a single adapter type but with very different configs. Right now, we have limited support for that (e.g. for LoRA, different r values by using rank_pattern), but with this, we don't need to special case the differing arguments anymore. Not implemented - [ ] I'm not yet sure if the same logic can be applied to IA³ or if it may fail because IA³ can apply its scaling to the input, not the output. - [ ] It is currently not possible to represent a mixed adapter model as a single config. I think we can come up with a solution but I don't think it is necessary for a first version of this. - [ ] Saving and loading is not yet implemented for mixed models. Those could potentially be added in a future PR.

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>

Note: Loading is still only supported for one adapter at a time. To enable loading multiple adapters at once (or saving them for that matter) requires us to first define a config format that can handle multiple adapters.

This PR adds the possibility to use different initialization methods for LoRA, as is a requirement for a completely backwards compatible adoption of PEFT in diffusers. The default is still the same as always, namely the one from the reference implementation by Microsoft. On top of that, it is now possible to pass `init_lora_weights='gaussian'` to initialize the LoRA weights in the same way as is default for diffusers, namely with a normal distribution which is scaled by 1/r. The init method currently applies to LoRA linear and conv layers, but not embedding layers, which are always initialized from a normal distribution (and are probably irrelevant for diffusers). In the future, similar extensions could be added for other adapter methods.

…ers and tokenizer (huggingface#1147) * add support for saving base layers weights along with adapter weights * Update save_and_load.py * Add an example showing the usage of the added feature * refactor the functionality * fix * refactoring code 1. Add `is_embedding_layer_resized` parameter to `save_pretrained` 2. Fix the deduplication in README when adding PEFT details. 3. `save_pretrained` should only save the model when `is_main_process=True` which is one of the parameters of `save_pretrained`. * update example * fix the model card * fix model card * 😅 * fix model card * automate setting `is_embedding_layer_resized` * nits * Update peft_lora_clm_with_additional_tokens.ipynb * add test * fix tests * maybe fixes the issue? * address comments Co-Authored-By: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> --------- Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

--------- Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com> Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

Adds option to use Megatron's ColumnParallelLinear and RowParallelLinear for LoRA linear layers, leading to improved performance when using LoRA with Megatron.

* Support OFT * add test * Update README * fix code quality * fix test * Skip 1 test * fix eps rule and add more test * feat: added examples to new OFT method * fix: removed wrong arguments from model example * fix: changed name of inference file * fix: changed prompt variable * fix docs * fix: dreambooth inference revision based on feedback * fix: review from BenjaminBossan * apply safe merge * del partially * refactor oft * refactor oft * del unused line * del unused line * fix skip in windows * skip test * Add comments about bias added place * rename orig_weights to new_weights * use inverse instead of linalg.inv * delete alpha and scaling --------- Co-authored-by: Lukas Kuhn <lukaskuhn.lku@gmail.com> Co-authored-by: Lukas Kuhn <lukas.kuhn@deutschebahn.com>

BenjaminBossan and others added 15 commits November 21, 2023 18:20

Fix enum repr for Python 3.11

91b3c10

Apply suggestions from code review

aca3275

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>

Enable loading for mixed models, adjust docs, test

bdd1373

Note: Loading is still only supported for one adapter at a time. To enable loading multiple adapters at once (or saving them for that matter) requires us to first define a config format that can handle multiple adapters.

Extend deletion test

28ef1d1

Improve error message and docstring

ee55ae3

Decoder test seems to be flaky, add some seeds

033dcdd

Merge branch 'main' into mixed-adapters-2

570e992

Examples: add options to save or push model (huggingface#1159)

00dc6f4

Add LoftQ initialization method for LoRA (huggingface#1150)

05819db

--------- Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com> Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

Megatron distributed parallel linear LoRA (huggingface#1092)

1e71585

Adds option to use Megatron's ColumnParallelLinear and RowParallelLinear for LoRA linear layers, leading to improved performance when using LoRA with Megatron.

Merge branch 'main' into mixed-adapters-2

d90f1e7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mixed adapters 2 #3

Mixed adapters 2 #3

BenjaminBossan commented Nov 30, 2023

Mixed adapters 2 #3

Are you sure you want to change the base?

Mixed adapters 2 #3

Conversation

BenjaminBossan commented Nov 30, 2023