LoraConfig conflict when using `layers_to_transform` in `LlamaModel` #2155

Evan02580 · 2024-10-17T10:42:05Z

System Info

peft: 0.13.2
transformers: 4.43.1

Who can help?

@BenjaminBossan @sayakpaul

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder
My own task or dataset (give details below)

Reproduction

When I tried to use LoraConfig and aimed to apply lora in first and last layers like:

lora_config = LoraConfig(
    r = 8,
    lora_alpha=16,
    target_modules=["q_proj", "k_proj", "v_proj", "o_proj"],
    layers_to_transform=[0,31],
    lora_dropout=0,
    bias = "none",
)
model = LlamaModel.from_pretrained("meta-llama/Meta-Llama-3-8B", torch_dtype=torch.bfloat16)
llama_model = get_peft_model(model, lora_config)

It came the problem that:

*** ValueError: Target modules ['q_proj', 'k_proj', 'v_proj', 'o_proj'] not found in the base model. Please check the target modules and try again.

The similar thing happen if I use layers_pattern instead of target_modules (but it should be my misunderstanding of layers_pattern):

lora_config = LoraConfig(
    ...
    layers_to_transform = 1, 
    layers_pattern = ["q_proj", "k_proj", "v_proj", "o_proj"], 
    ...
)
get_peft_model(model, lora_config)

*** ValueError: Target modules {'v_proj', 'q_proj'} not found in the base model. Please check the target modules and try again.

But this time the problem shoud be the problem of default value of target_modules.

However, when I use model = AutoModelForCausalLM.from_pretrained("meta-llama/Meta-Llama-3-8B", torch_dtype=torch.bfloat16, trust_remote_code=True) instead, it made it.

Expected behavior

I'm not sure if it was the problem of LlamaModel. And I do also confuse about the use of layers_patten, since of doc of LoRA mentioned:

layers_to_transform: List of layers to be transformed by LoRA. If not specified, all layers in target_modules are transformed.
layers_pattern: Pattern to match layer names in target_modules, if layers_to_transform is specified. By default PeftModel will look at common layer pattern (layers, h, blocks, etc.), use it for exotic and custom models.

It should work with layers_to_transform, however, I didn'd find a suitable approach to use. Maybe some examples can be put in class LoraConfig(PeftConfig)?

The text was updated successfully, but these errors were encountered:

BenjaminBossan · 2024-10-17T12:30:23Z

Thanks for reporting the issue. Indeed, the usage of layers_to_transform and layers_pattern is a bit confusing and the error message is not helpful.

The idea here is that if we have a nn.ModuleList with 32 layers in this case, the layers_pattern should designate this nn.ModuleList: layers_pattern="layers". Therefore, this works for me:

lora_config = LoraConfig(
    r = 8,
    lora_alpha=16,
    target_modules=["q_proj", "k_proj", "v_proj"],
    layers_to_transform=[0, 31],
    layers_pattern="layers",
    lora_dropout=0,
    bias = "none",
)

However, as you noted, using LlamaModel directly does not work. This is a result of how we specify a regex and I think we can amend it to work with LlamaModel too. So for now, please use AutoModelForCausalLM with the LoraConfig I showed and you should be good.

The TODOs from this issue are:

Improve the documentation of these arguments to clarify what users need to pass.
Amend the regex to make the prefix before the layers_pattern optional.
Adjust the error message for the case that users pass layers_to_transform and layers_pattern (right now, the error message assumes that users only pass target_modules.

For point 3, would you be interested in tackling this @JINO-ROHIT since you refactored that part in #2102?

JINO-ROHIT · 2024-10-17T12:37:38Z

@BenjaminBossan yeap il be happy to work on this

Addresses part of huggingface#2155. Also fix type annotations where appropriate.

Addreses part of huggingface#2155. Description So far, the layers_pattern argument would only work if there was a prefix to the pattern. As an example, if the module name is: decoder.layer.0.attn.to_q and we pass layers_pattern="layer", this would match. However, if the module name was: layer.0.attn.to_q it would not work. Usually, when we create a model with AutoModelForFoo.from_pretrained, the "layer" part would never be first. However, if we load a model directly, e.g. through LlamaModel.from_pretrained, there is actually no prefix. As a consequence, we get no match there. With this PR, the prefix is made optional, so that the second pattern also matches. Status I'm not sure yet if this should be merged, as it is technically backwards incompatible. Users can still target the desired modules by carefully crafting a regex for target_modules so that it only matches the desired layer indices. However, this is tedious and layers_pattern was introduced to avoid having to do this.

BenjaminBossan · 2024-10-17T15:11:01Z

@Evan02580 I created a PR to improve the docs in #2157 and another PR to adapt the regex in #2158. For the latter, I'm unsure if we should proceed though, as technically this is a backwards-incompatible change.

Addresses part of #2155. Also fix type annotations where appropriate.

Addresses part of huggingface#2155. Also fix type annotations where appropriate.

github-actions · 2024-11-16T15:03:37Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Evan02580 changed the title ~~LoraConfig conflict when using layers_to_transform~~ LoraConfig conflict when using layers_to_transform in LlamaModel Oct 17, 2024

BenjaminBossan added a commit to BenjaminBossan/peft that referenced this issue Oct 17, 2024

DOC Improve docs for layers_pattern argument

f65e116

Addresses part of huggingface#2155. Also fix type annotations where appropriate.

BenjaminBossan mentioned this issue Oct 17, 2024

DOC: Improve docs for layers_pattern argument #2157

Merged

BenjaminBossan mentioned this issue Oct 17, 2024

Change layers_pattern logic #2158

Draft

JINO-ROHIT mentioned this issue Oct 17, 2024

added checks for layers to transforms and layer pattern in lora #2159

Merged

BenjaminBossan added a commit that referenced this issue Oct 18, 2024

DOC Improve docs for layers_pattern argument (#2157)

e8259ff

Addresses part of #2155. Also fix type annotations where appropriate.

JINO-ROHIT mentioned this issue Oct 19, 2024

Improving error message when users pass layers_to_transform and layers_pattern #2164

Closed

sirluk pushed a commit to sirluk/peft that referenced this issue Oct 19, 2024

DOC Improve docs for layers_pattern argument (huggingface#2157)

c0f1722

Addresses part of huggingface#2155. Also fix type annotations where appropriate.

yaswanth19 pushed a commit to yaswanth19/peft that referenced this issue Oct 20, 2024

DOC Improve docs for layers_pattern argument (huggingface#2157)

191dc98

Addresses part of huggingface#2155. Also fix type annotations where appropriate.

yaswanth19 pushed a commit to yaswanth19/peft that referenced this issue Oct 20, 2024

DOC Improve docs for layers_pattern argument (huggingface#2157)

f7a4f5b

Addresses part of huggingface#2155. Also fix type annotations where appropriate.

JINO-ROHIT mentioned this issue Oct 22, 2024

Improving error message when users pass layers_to_transform and layers_pattern #2169

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LoraConfig conflict when using `layers_to_transform` in `LlamaModel` #2155

LoraConfig conflict when using `layers_to_transform` in `LlamaModel` #2155

Evan02580 commented Oct 17, 2024

BenjaminBossan commented Oct 17, 2024

JINO-ROHIT commented Oct 17, 2024

BenjaminBossan commented Oct 17, 2024

github-actions bot commented Nov 16, 2024

LoraConfig conflict when using layers_to_transform in LlamaModel #2155

LoraConfig conflict when using layers_to_transform in LlamaModel #2155

Comments

Evan02580 commented Oct 17, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

BenjaminBossan commented Oct 17, 2024

JINO-ROHIT commented Oct 17, 2024

BenjaminBossan commented Oct 17, 2024

github-actions bot commented Nov 16, 2024

LoraConfig conflict when using `layers_to_transform` in `LlamaModel` #2155

LoraConfig conflict when using `layers_to_transform` in `LlamaModel` #2155