The Implementation of AdaLoRA (ICLR 2023) #233

QingruZhang · 2023-03-30T06:44:47Z

Dear PEFT maintainers,
This is Qingru Zhang, the author of "Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning" (ICLR 2023, please
see the link). We would like to submit this PR to intergrate AdaLoRA into PEFT. It was a great discussion with Sourab about the implementation of AdaLoRA and its intergration into PEFT. Thanks a lot for Sourab's comments and support during we prepare this PR. It would be great to have AdaLoRA available in PEFT! Please let me know in case of any questions about the impelmentation.

Thanks,
Qingru

pacman100

This is awesome 🔥. Well done @QingruZhang and Thank you for making AdaLoRA easy to use for the community 🤗. LGTM!

Left a few comments and suggestions. could you also run make style and make quality to fix the code quality CI.

src/peft/tuners/adalora.py

src/peft/mapping.py

src/peft/tuners/adalora.py

Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>

HuggingFaceDocBuilderDev · 2023-04-05T21:27:08Z

The documentation is not available anymore as the PR was closed or merged.

pacman100 · 2023-04-06T05:42:05Z

Hello @QingruZhang, I applied AdaLoRA to Whisper large fine-tuning, here is the wandb run

There is Improvement in normalized wer (2.4% relative improvement, matching the fully fine-tuned upto first decimal place) in comparison to LoRA. However, no improvement in wer.
Interestingly, it preserved a lot more trainable params in encoder than decoder.
In decoder, fc1 was the most important target layer to add the low rank matrices.
Final trainable params post the budget aware adalora tuning: trainable params: 15520256 || all params: 1558825701 || trainable%: 0.9956376771337311 . For LoRA it is trainable params: 15728640 || all params: 1559033600 || trainable%: 1.0088711365810203 . So, a little less trainable params than LoRA after the budget aware pruning.

pacman100

Thank you @QingruZhang for iterating, LGTM! 🤗

QingruZhang · 2023-04-15T22:08:22Z

Hello @pacman100 , thanks for merging the commmits and running the test for AdaLoRA! Typically, we should set the initial budget as 1.5 times of final target budget and tune the budget schedule to have enough final fine-tuning steps to get the good performance. Please let me know if there are more experimental tests I need to do. Thanks agian for your help during this process!

chenweizhu · 2023-04-26T13:39:51Z

hi @pacman100 , do you also measure the peak GPU memory consumption, training time and other metrics?

it would also be interesting to compare all these metrics, including the quality, when we set the budget as 1.5X or 2X.

chenweizhu · 2023-04-26T13:45:34Z

and how about run some test on the Llama 7B or 13B model?

Qingru Zhang and others added 22 commits February 27, 2023 21:08

train script

81eec9b

add adalora example

26b84e6

Implement the AdaLoRA

be86f90

target module mapping for adalora

4acd811

peft import

6a03e43

adalora example

510f172

test for adalora example

1a3680d

example

35cd771

finish the testing and debugging

7471035

adalora training example

0a0c6ea

update comment

fa65b95

Merge remote-tracking branch 'upstream/main'

2c84a5e

merge the conflit'

d242dc0

Impelment the budget finalization

1141b12

Finish the test for rank finalization

d6ae665

define the resize function

ce61e24

save rank pattern

d3a48a8

Save the rank pattern into the config file

ccf53ad

refine the key of rank pattern

300abd1

Implement the save_pretrained for AdaLoRA

e3b4cd4

Finish the test for model load and save

d429230

Merge branch 'huggingface:main' into main

f15548e

pacman100 approved these changes Mar 30, 2023

View reviewed changes

QingruZhang and others added 7 commits April 5, 2023 16:23

Update src/peft/tuners/adalora.py

3e6a88a

Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>

Update src/peft/tuners/adalora.py

b3e6ef6

Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>

Update src/peft/tuners/adalora.py

c240a96

Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>

Update src/peft/tuners/adalora.py

9a534d0

Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>

Update src/peft/mapping.py

d892beb

Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>

Update src/peft/tuners/adalora.py

b8a57a3

Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>

raise exception for MergedLinear of AdaLoRA

4f8c134

Run make style and make quality

072da6d

QingruZhang requested a review from pacman100 April 5, 2023 21:51

pacman100 approved these changes Apr 6, 2023

View reviewed changes

pacman100 merged commit a7d5e51 into huggingface:main Apr 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The Implementation of AdaLoRA (ICLR 2023) #233

The Implementation of AdaLoRA (ICLR 2023) #233

QingruZhang commented Mar 30, 2023

pacman100 left a comment

HuggingFaceDocBuilderDev commented Apr 5, 2023 •

edited

Loading

pacman100 commented Apr 6, 2023 •

edited

Loading

pacman100 left a comment •

edited

Loading

QingruZhang commented Apr 15, 2023

chenweizhu commented Apr 26, 2023 •

edited

Loading

chenweizhu commented Apr 26, 2023 •

edited

Loading

The Implementation of AdaLoRA (ICLR 2023) #233

The Implementation of AdaLoRA (ICLR 2023) #233

Conversation

QingruZhang commented Mar 30, 2023

pacman100 left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Apr 5, 2023 • edited Loading

pacman100 commented Apr 6, 2023 • edited Loading

pacman100 left a comment • edited Loading

Choose a reason for hiding this comment

QingruZhang commented Apr 15, 2023

chenweizhu commented Apr 26, 2023 • edited Loading

chenweizhu commented Apr 26, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Apr 5, 2023 •

edited

Loading

pacman100 commented Apr 6, 2023 •

edited

Loading

pacman100 left a comment •

edited

Loading

chenweizhu commented Apr 26, 2023 •

edited

Loading

chenweizhu commented Apr 26, 2023 •

edited

Loading