How to add an adapter to a quantized model without peft? #660

mkgs210 · 2024-03-20T19:28:25Z

Environment info

adapters version: 0.1.1
Platform: Windows 10
Python version: 3.11.0
PyTorch version (GPU?): 2.1.0
Using GPU in script?: Yes
Using distributed or parallel set-up in script?: No

Information

Model I am using "ai-forever/rugpt3large_based_on_gpt2":

Language I am using the model on Russian:

Adapter setup I am using (if any): BnConfig, SeqBnConfig, DoubleSeqBnConfig, PrefixTuningConfig, LoRAConfig, IA3Config, PromptTuningConfig, MAMConfig, UniPELTConfig

The problem arises when using:

the official example scripts:
my own modified scripts: I used https://github.com/adapter-hub/adapters/blob/main/notebooks/Text_Generation_Training.ipynb
and https://colab.research.google.com/drive/1VoYNfYDKcKRQRor98Zbf2-9VQTtGJ24k?usp=sharing&pli=1#scrollTo=Ybeyl20n3dYH

The tasks I am working on is:

an official GLUE/SQUaD task: text generation
my own task or dataset:

To reproduce

Steps to reproduce the behavior:

Use the Text_Generation_Training adapters example
Load model in 4bit or 8bit
delete "init(model)" because it is not working and replace AutoModelForCausalLM by AutoAdapterModel
trainer.train

Expected behavior

Fine-tuning the model

Real behavior

I'm trying to add an adapter to a quantized model. I would like to use not only the LORA adapter available in PEFT, but also other adapters.
However, as soon as I run the training, the error appears:

ValueError: You cannot perform fine-tuning on purely quantized models. Please attach trainable adapters on top of the quantized model to correctly perform fine-tuning. Please see: https://huggingface.co/docs/transformers/peft for more details

i tried to use PEFT features like prepare_model_for_kbit_training but couldn't add non PEFT adapter

The text was updated successfully, but these errors were encountered:

calpt · 2024-03-31T14:55:38Z

Hey @mkgs210, training on quantized models in the style of e.g. QLoRA is not currently supported by the released version ofadapters. There's a WIP pull request for adding this support here though: #663.

This PR also adds a colab notebook to demo finetuning Llama 2 with QLoRA and adapters: https://github.com/calpt/adapter-transformers/blob/dev/qlora/notebooks/QLoRA_Llama2_Finetuning.ipynb.

Would love for you to test this out and help us adding support for training quantized models to adapters, thanks!

mkgs210 added the bug Something isn't working label Mar 20, 2024

calpt added enhancement New feature or request and removed bug Something isn't working labels Mar 31, 2024

calpt self-assigned this Mar 31, 2024

calpt linked a pull request Mar 31, 2024 that will close this issue

Add support for QLoRA/ QAdapter training via bitsandbytes #663

Merged

3 tasks

calpt closed this as completed in #663 Apr 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to add an adapter to a quantized model without peft? #660

How to add an adapter to a quantized model without peft? #660

mkgs210 commented Mar 20, 2024 •

edited

Loading

calpt commented Mar 31, 2024

How to add an adapter to a quantized model without peft? #660

How to add an adapter to a quantized model without peft? #660

Comments

mkgs210 commented Mar 20, 2024 • edited Loading

Environment info

Information

To reproduce

Expected behavior

Real behavior

calpt commented Mar 31, 2024

mkgs210 commented Mar 20, 2024 •

edited

Loading