Adapter weights are cast in float16 unexpectedly after load_adapter #1090

hiyouga · 2023-11-07T14:41:22Z

System Info

Ubuntu + NVIDIA V100

transformers 4.34.1
peft 0.6.0

Who can help?

@pacman100 @younesbelkada

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder
My own task or dataset (give details below)

Reproduction

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

tok = AutoTokenizer.from_pretrained("llama2-7b")
model = AutoModelForCausalLM.from_pretrained("llama2-7b", torch_dtype="auto", device_map="auto")

adapter_model = PeftModel.from_pretrained(model, "lora_weight", is_trainable=True)
for name, param in adapter_model.named_parameters():
    print(name, param.dtype) 

"""
base_model.model.model.layers.31.self_attn.v_proj.weight torch.float16
base_model.model.model.layers.31.self_attn.v_proj.lora_A.default.weight torch.float32 (correct)
base_model.model.model.layers.31.self_attn.v_proj.lora_B.default.weight torch.float32
"""

adapter_model.load_adapter("another_lora_weight", "reward", is_trainable=True)
for name, param in adapter_model.named_parameters():
    print(name, param.dtype)

"""
base_model.model.model.layers.31.self_attn.v_proj.weight torch.float16
base_model.model.model.layers.31.self_attn.v_proj.lora_A.default.weight torch.float16 (incorrect)
base_model.model.model.layers.31.self_attn.v_proj.lora_A.reward.weight torch.float16
base_model.model.model.layers.31.self_attn.v_proj.lora_B.default.weight torch.float16
base_model.model.model.layers.31.self_attn.v_proj.lora_B.reward.weight torch.float16
"""

Besides, add_adapter also triggers a similar problem for peft 0.6.0.

Expected behavior

The data type of adapter weights after load_adapter should remain float32

The text was updated successfully, but these errors were encountered:

github-actions · 2023-12-07T15:03:50Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

camposs1979 mentioned this issue Nov 28, 2023

咨询：freeze模式下，支持PPO调试么 hiyouga/LLaMA-Factory#1646

Closed

hiyouga closed this as not planned Won't fix, can't repro, duplicate, stale Dec 13, 2023

hiyouga mentioned this issue Dec 13, 2023

About the dtype of trainable params #1249

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adapter weights are cast in float16 unexpectedly after load_adapter #1090

Adapter weights are cast in float16 unexpectedly after load_adapter #1090

hiyouga commented Nov 7, 2023 •

edited

Loading

github-actions bot commented Dec 7, 2023

Adapter weights are cast in float16 unexpectedly after load_adapter #1090

Adapter weights are cast in float16 unexpectedly after load_adapter #1090

Comments

hiyouga commented Nov 7, 2023 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

github-actions bot commented Dec 7, 2023

hiyouga commented Nov 7, 2023 •

edited

Loading