GPTQ Integration #771

SunMarc · 2023-08-01T22:49:05Z

What does this PR do ?

This PR adds the possibility to train lora + adalora adapters on top of GPTQ quantized model.

convert to peft model for training

causal_lm_model_id = "marcsun13/opt-350m-gptq-4bit"
tokenizer  = AutoTokenizer.from_pretrained(causal_lm_model_id)
model = AutoModelForCausalLM.from_pretrained(
    causal_lm_model_id,
    torch_dtype=torch.float16,
    device_map="auto"
)
model = prepare_model_for_kbit_training(model)
config = LoraConfig(
      r=16,
      lora_alpha=32,
      target_modules=["q_proj", "v_proj"],
      lora_dropout=0.05,
      bias="none",
      task_type="CAUSAL_LM",
)
model = get_peft_model(model, config)

save adapters after training

model.cpu().save_pretrained(save_dir)

load saved adapters

model = AutoModelForCausalLM.from_pretrained(
    causal_lm_model_id,
    torch_dtype=torch.float16,
    device_map="auto"
)
model = PeftModel.from_pretrained(model ,save_dir)# load saved adapters

to do

finetune llama2
merge after transformers PR (the doc PR test will then be fixed)

HuggingFaceDocBuilderDev · 2023-08-01T22:53:27Z

The documentation is not available anymore as the PR was closed or merged.

pacman100

Hello @SunMarc, Thank you for adding AutoGPTQ support 🚀. Left few comments

src/peft/tuners/lora.py

BenjaminBossan

Thanks a lot. This PR looks really good, I only have minor comments. I don't have any experience with GPTQ itself (yet), so I cannot really judge the more technical parts of the implementation.

A more general question: Does GPTQ generally not work with IA³ or is it just a matter of implementing it later?

src/peft/tuners/adalora.py

src/peft/tuners/ia3.py

src/peft/tuners/lora.py

younesbelkada

Looking great, thanks a lot @SunMarc ! I left one comment
We can add docs and an example script later in a follow up PR
Thanks! 🚀

docker/peft-gpu/Dockerfile

BenjaminBossan

The PR is looking already quite good from my POV. Unfortunately, the tests are still failing, I assume because they require the corresponding changes to land in transformers.

I have a few comments, but none of them are deal breakers.

src/peft/tuners/adalora.py

BenjaminBossan · 2023-08-10T08:09:00Z

src/peft/tuners/lora.py

+        LoraLayer.__init__(
+            self, in_features=quant_linear_module.infeatures, out_features=quant_linear_module.outfeatures
+        )
+        self.quant_linear_module = quant_linear_module


This is an interesting deviation from how the other lora layers are implemented. Here, we pass the original layer (quant_linear_module) and use it under the hood. For the normal Linear lora layer, we don't get the layer, instead basically creating a new linear layer:

# in __init__ nn.Linear.__init__(self, in_features, out_features, **kwargs) # in forward result = F.linear(x, transpose(self.weight, self.fan_in_fan_out), bias=self.bias)

I actually prefer the solution here but wonder if there was a specific reason why this approach was not taken originally. If so, would that same reason apply here or are we good with having two different approaches? Hopefully, the others can clarify this.

I did that because I wanted to put the new QuantLinear class in the same place as the others for conformity. If we want to go with the same approach, we will need to create this new class in a function so that the auto_gptq import is protected ( to avoid circular import as it is also importing peft ) . LMK what you think about this solution and I will add it in another PR.

tests/test_gpu_examples.py

SunMarc · 2023-08-10T20:19:19Z

Unfortunately, the tests are still failing, I assume because they require the corresponding changes to land in transformers.

I don't want to break the tests so i hardcoded the const value. I will change them back when we will have the next release of transformers

SunMarc added 2 commits July 31, 2023 23:02

add gptq lora

e82c460

fix peft gptq

63eb082

SunMarc requested review from younesbelkada and pacman100 August 1, 2023 22:49

SunMarc added 2 commits August 1, 2023 23:14

fix condition

572341c

fix test

f9f3a0c

pacman100 reviewed Aug 2, 2023

View reviewed changes

src/peft/tuners/lora.py Outdated Show resolved Hide resolved

src/peft/tuners/lora.py Show resolved Hide resolved

BenjaminBossan approved these changes Aug 2, 2023

View reviewed changes

src/peft/tuners/adalora.py Outdated Show resolved Hide resolved

src/peft/tuners/ia3.py Outdated Show resolved Hide resolved

src/peft/tuners/lora.py Outdated Show resolved Hide resolved

src/peft/tuners/lora.py Outdated Show resolved Hide resolved

SunMarc added 3 commits August 2, 2023 13:44

remove unused weights

27d8d08

check type

8630fb5

style

bdef595

SunMarc requested a review from pacman100 August 2, 2023 14:29

younesbelkada approved these changes Aug 2, 2023

View reviewed changes

docker/peft-gpu/Dockerfile Show resolved Hide resolved

SunMarc added 5 commits August 2, 2023 17:41

change attribute

4a465f1

remove print

b94bb13

add exllama

8026c17

Merge remote-tracking branch 'upstream/main' into peft_gptq

91685f5

make style

d760343

SunMarc requested a review from BenjaminBossan August 9, 2023 23:20

BenjaminBossan reviewed Aug 10, 2023

View reviewed changes

SunMarc added 3 commits August 10, 2023 18:51

refactor + fix tests

fe01f18

remove print

0182a2d

remove dep on transformers

caf4922

SunMarc merged commit a916465 into huggingface:main Aug 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPTQ Integration #771

GPTQ Integration #771

SunMarc commented Aug 1, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 1, 2023 •

edited

Loading

pacman100 left a comment

BenjaminBossan left a comment

younesbelkada left a comment

BenjaminBossan left a comment

BenjaminBossan Aug 10, 2023

SunMarc Aug 10, 2023 •

edited

Loading

SunMarc commented Aug 10, 2023

GPTQ Integration #771

GPTQ Integration #771

Conversation

SunMarc commented Aug 1, 2023 • edited Loading

What does this PR do ?

convert to peft model for training

save adapters after training

load saved adapters

to do

HuggingFaceDocBuilderDev commented Aug 1, 2023 • edited Loading

pacman100 left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan Aug 10, 2023

Choose a reason for hiding this comment

SunMarc Aug 10, 2023 • edited Loading

Choose a reason for hiding this comment

SunMarc commented Aug 10, 2023

SunMarc commented Aug 1, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 1, 2023 •

edited

Loading

SunMarc Aug 10, 2023 •

edited

Loading