add ia3 and adalora support #809

sywangyi · 2024-03-15T10:33:14Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2024-03-15T11:01:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sywangyi · 2024-03-17T02:46:16Z

huggingface/peft#1540 fixed adalora rank==0 issue in inference load model.

sywangyi · 2024-03-25T03:52:04Z

update peft to 0.10.0 which contains huggingface/peft#1540

sywangyi · 2024-04-06T12:28:03Z

adalora + deepspeed zero3 finetune fix in peft huggingface/peft#1625

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

sywangyi · 2024-04-09T06:20:21Z

IA3 finetune with "--use_flash_attention" need PR fix in huggingface/peft#1634. since q, k, v should has some data type in FusedSDPA.apply

regisss

For PEFT v0.10, there was an issue with higher memory consumption from PEFT v0.7: #590
Let me try to see if that works.

optimum/habana/peft/layer.py

examples/language-modeling/run_lora_clm.py

regisss · 2024-04-12T22:17:07Z

We're still having an issue with Falcon-180b and more recent versions of PEFT. Let me see how we can do.

sywangyi · 2024-04-17T05:14:07Z

try #895 and it could fix falcon 180b issue

regisss · 2024-04-18T12:51:21Z

@sywangyi LGTM! Can you share a command for both ia3 and adalora to make sure I can run them please?

sywangyi · 2024-04-18T13:48:07Z

Hi, @regisss . you could use "--peft_type ia3" for IA3 and "--peft_type adalora" for adalora in run_lora_clm.py command for alpaca dataset

like

python3 run_lora_clm.py \
  --model_name_or_path meta-llama/Llama-2-7b-hf \
  --dataset_name tatsu-lab/alpaca \
  --output_dir ./lora_out \
  --num_train_epochs 1 \
  --max_step 500 \
  --max_seq_len 2048 \
  --per_device_train_batch_size 1 \
  --per_device_eval_batch_size 10 \
  --evaluation_strategy epoch \
  --eval_delay 2 \
  --save_strategy no \
  --learning_rate 0.0018 \
  --warmup_ratio 0.03 \
  --lr_scheduler_type "cosine" \
  --logging_steps 1 \
  --dataset_concatenation \
  --do_train \
  --do_eval \
  --use_habana \
  --use_lazy_mode \
  --throughput_warmup_steps 3 \
  --lora_rank 4 \
  --lora_target_modules "q_proj" "v_proj" "k_proj" "o_proj" \
  --validation_split_percentage 4 \
  --peft_type ia3 \
  --pipelining_fwd_bwd \
  --bf16 \
  --attn_softmax_bf16 True 

python3 run_lora_clm.py \
  --model_name_or_path meta-llama/Llama-2-7b-hf \
  --dataset_name tatsu-lab/alpaca \
  --output_dir ./lora_out \
  --num_train_epochs 1 \
  --max_step 500 \
  --max_seq_len 2048 \
  --per_device_train_batch_size 1 \
  --per_device_eval_batch_size 10 \
  --evaluation_strategy epoch \
  --eval_delay 2 \
  --save_strategy no \
  --learning_rate 0.0018 \
  --warmup_ratio 0.03 \
  --lr_scheduler_type "cosine" \
  --logging_steps 1 \
  --dataset_concatenation \
  --do_train \
  --do_eval \
  --use_habana \
  --use_lazy_mode \
  --throughput_warmup_steps 3 \
  --lora_rank 4 \
  --lora_target_modules "q_proj" "v_proj" "k_proj" "o_proj" \
  --validation_split_percentage 4 \
  --peft_type adalora\
  --pipelining_fwd_bwd \
  --bf16 \
  --attn_softmax_bf16 True

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* Add ia3 and adalora support (huggingface#809) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * Fix failure in Llama-70B-FSDP test * Fix peft files --------- Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Co-authored-by: Wang, Yi <yi.a.wang@intel.com>

sywangyi requested a review from regisss as a code owner March 15, 2024 10:33

sywangyi force-pushed the adalora_ia3 branch from a13bb2c to 4023a76 Compare March 15, 2024 10:57

sywangyi force-pushed the adalora_ia3 branch from 4023a76 to 53e3357 Compare March 25, 2024 03:49

sywangyi force-pushed the adalora_ia3 branch from 53e3357 to 75085e5 Compare March 25, 2024 06:44

sywangyi force-pushed the adalora_ia3 branch 3 times, most recently from a77bbba to 3540ec8 Compare April 6, 2024 12:27

add ia3 and adalora support

844d6f4

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

sywangyi force-pushed the adalora_ia3 branch from 3540ec8 to 844d6f4 Compare April 9, 2024 03:17

regisss reviewed Apr 11, 2024

View reviewed changes

optimum/habana/peft/layer.py Show resolved Hide resolved

examples/language-modeling/run_lora_clm.py Show resolved Hide resolved

regisss approved these changes Apr 19, 2024

View reviewed changes

regisss merged commit 7a1dce9 into main Apr 19, 2024
9 checks passed

regisss deleted the adalora_ia3 branch April 19, 2024 14:43

regisss mentioned this pull request Apr 19, 2024

enable prompt tuning/prefix tuning/p tuning clm and example #758

Merged

3 tasks

vivekgoe pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Jun 5, 2024

Add ia3 and adalora support (huggingface#809)

0cde468

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

vivekgoe pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Jun 5, 2024

Add ia3 and adalora support (huggingface#809)

a43f5e8

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

astachowiczhabana mentioned this pull request Jun 24, 2024

Add ia3 and adalora support (#809) HabanaAI/optimum-habana-fork#235

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add ia3 and adalora support #809

add ia3 and adalora support #809

sywangyi commented Mar 15, 2024

HuggingFaceDocBuilderDev commented Mar 15, 2024

sywangyi commented Mar 17, 2024

sywangyi commented Mar 25, 2024

sywangyi commented Apr 6, 2024

sywangyi commented Apr 9, 2024

regisss left a comment

regisss commented Apr 12, 2024

sywangyi commented Apr 17, 2024

regisss commented Apr 18, 2024

sywangyi commented Apr 18, 2024 •

edited

Loading

add ia3 and adalora support #809

add ia3 and adalora support #809

Conversation

sywangyi commented Mar 15, 2024

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Mar 15, 2024

sywangyi commented Mar 17, 2024

sywangyi commented Mar 25, 2024

sywangyi commented Apr 6, 2024

sywangyi commented Apr 9, 2024

regisss left a comment

Choose a reason for hiding this comment

regisss commented Apr 12, 2024

sywangyi commented Apr 17, 2024

regisss commented Apr 18, 2024

sywangyi commented Apr 18, 2024 • edited Loading

sywangyi commented Apr 18, 2024 •

edited

Loading