difference between get_peft_model and from_pretrained #470

nuoma · 2023-05-19T02:42:36Z

Hi, when I'm training LLAMA with Lora, i seem to get different results when loading Lora weights via get_peft_model and from_pretrained. Can't really tell why,both can infer successfully, but with vastly different results.

Could someone be so kind to tell me which is the correct way of doing this. Many thanks!

yyqi17 · 2023-05-23T07:55:32Z

Hi nuoma, could you please show me your code in saving & loading checkpoints? I'm finetuning llama with lora, and seem to encounter some issues when infering my checkpoint (saved by model.save_pretrained() and loaded by PeftModel.from_pretrained(base_mode, lora_ckpt_path)). Thanks a lot!

ingo-m · 2023-05-26T13:29:55Z

Possibly related? #503 🤔

github-actions · 2023-06-19T15:03:23Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

github-actions bot closed this as completed Jun 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

difference between get_peft_model and from_pretrained #470

difference between get_peft_model and from_pretrained #470

nuoma commented May 19, 2023

yyqi17 commented May 23, 2023

ingo-m commented May 26, 2023

github-actions bot commented Jun 19, 2023

difference between get_peft_model and from_pretrained #470

difference between get_peft_model and from_pretrained #470

Comments

nuoma commented May 19, 2023

yyqi17 commented May 23, 2023

ingo-m commented May 26, 2023

github-actions bot commented Jun 19, 2023