Add LoRA to specific layers #427

LiJunnan1992 · 2023-05-10T08:55:22Z

Thanks for the great library!

It could be quite useful for many applications to support specifying the layers to insert the adapter. For example, completely freezing some earlier layers could save huge computation cost due to the fewer back-prop layers.

Is there any plan to support this? Or any advice on where should I modify if I want to implement this myself?

Thank you!
Junnan

younesbelkada · 2023-05-10T09:13:15Z

Hi @LiJunnan1992 !
Thanks for your message,
Currently we do support LoRA transformation of specific layers, for example this snippet gives:

from peft import get_peft_model, LoraConfig
from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("facebook/opt-350m").to(0)

config = LoraConfig(
    r=16,
    lora_alpha=32,
    lora_dropout=0.05,
    bias="none",
    target_modules=['model.decoder.layers.0.self_attn.v_proj', 'model.decoder.layers.1.self_attn.v_proj', 'model.decoder.layers.2.self_attn.v_proj']
)

model = get_peft_model(model, config)
model.print_trainable_parameters()
# trainable params: 98304 || all params: 331294720 || trainable%: 0.029672673322412142

model = AutoModelForCausalLM.from_pretrained("facebook/opt-350m").to(0)

config = LoraConfig(
    r=16,
    lora_alpha=32,
    lora_dropout=0.05,
    bias="none"
)

model = get_peft_model(model, config)
model.print_trainable_parameters()
# trainable params: 1572864 || all params: 332769280 || trainable%: 0.472659014678278

However users needs to manually feed the full name of the modules, does what you had in mind corresponds to explicitly giving the number of layers that you want to "ignore" for LoRA transformation?

cc @pacman100

LiJunnan1992 · 2023-05-10T09:17:18Z

Thanks for the reply. The current solution already works for me!
For general users, it might be more convenient if the layers can be directly specified in LoraConfig.

younesbelkada · 2023-05-10T10:02:53Z

Thanks for the suggestion!
I made a PR: #429 that should allow doing this in a more user-friendly manner

from peft import get_peft_model, LoraConfig
from transformers import AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained("facebook/opt-350m").to(0)

config = LoraConfig(
    r=16,
    lora_alpha=32,
    lora_dropout=0.05,
    bias="none",
    layers_to_transform=[0, 1, 2],
)

model = get_peft_model(model, config)
model.print_trainable_parameters()

younesbelkada mentioned this issue May 10, 2023

[LoRA] Allow applying LoRA at different stages #429

Merged

younesbelkada closed this as completed in #429 Jun 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LoRA to specific layers #427

Add LoRA to specific layers #427

LiJunnan1992 commented May 10, 2023

younesbelkada commented May 10, 2023 •

edited

Loading

LiJunnan1992 commented May 10, 2023

younesbelkada commented May 10, 2023

Add LoRA to specific layers #427

Add LoRA to specific layers #427

Comments

LiJunnan1992 commented May 10, 2023

younesbelkada commented May 10, 2023 • edited Loading

LiJunnan1992 commented May 10, 2023

younesbelkada commented May 10, 2023

younesbelkada commented May 10, 2023 •

edited

Loading