[DPO] Merge initial peft model if trainer has a peft_config #956

kashif · 2023-11-05T13:40:48Z

fixes #742

We merge the initial peft adaptor if we are training with a peft_config

Co-authored-by: Shoaib Burq <saburq@gmail.com>

HuggingFaceDocBuilderDev · 2023-11-05T13:44:31Z

The documentation is not available anymore as the PR was closed or merged.

kashif · 2023-11-05T13:50:23Z

@Elfsong can you check if i am testing the failure case you have in the test?

Elfsong · 2023-11-05T14:15:06Z

@Elfsong can you check if i am testing the failure case you have in the test?

@kashif Sure. I’m testing now. It doesn’t work on my side for now. The LoRA weights didn’t add upon the base model. Not sure if it’s my code problem.

Elfsong · 2023-11-05T14:17:55Z

@Elfsong can you check if i am testing the failure case you have in the test?

@kashif Sure. I’m testing now. It doesn’t work on my side for now. The LoRA weights didn’t add upon the base model. Not sure if it’s my code problem.

I’m currently working on this demo FYI:
https://github.com/huggingface/trl/tree/main/examples/research_projects/stack_llama_2/scripts

Elfsong

LGTM! Thank you!

younesbelkada

LGTM - given that this is a common scenario (people can SFT their model with peft and use that model for DPO) ! Thanks @kashif !

…ace#956) * failing test Co-authored-by: Shoaib Burq <saburq@gmail.com> * merge initial peft model

kashif added 2 commits November 5, 2023 14:37

failing test

a8cebd9

Co-authored-by: Shoaib Burq <saburq@gmail.com>

merge initial peft model

ddf497c

Elfsong reviewed Nov 5, 2023

View reviewed changes

younesbelkada approved these changes Nov 6, 2023

View reviewed changes

younesbelkada merged commit 6c6ff24 into huggingface:main Nov 6, 2023
8 checks passed

kashif deleted the issue-742 branch November 6, 2023 08:52

Elfsong mentioned this pull request Nov 6, 2023

Run reward_modeling.py by multi GPUs but model save error #939

Closed

lapp0 pushed a commit to lapp0/trl that referenced this pull request May 10, 2024

[DPO] Merge initial peft model if trainer has a peft_config (huggingf…

4ea0277

…ace#956) * failing test Co-authored-by: Shoaib Burq <saburq@gmail.com> * merge initial peft model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DPO] Merge initial peft model if trainer has a peft_config #956

[DPO] Merge initial peft model if trainer has a peft_config #956

kashif commented Nov 5, 2023

HuggingFaceDocBuilderDev commented Nov 5, 2023 •

edited

Loading

kashif commented Nov 5, 2023

Elfsong commented Nov 5, 2023

Elfsong commented Nov 5, 2023

Elfsong left a comment

younesbelkada left a comment

[DPO] Merge initial peft model if trainer has a peft_config #956

[DPO] Merge initial peft model if trainer has a peft_config #956

Conversation

kashif commented Nov 5, 2023

HuggingFaceDocBuilderDev commented Nov 5, 2023 • edited Loading

kashif commented Nov 5, 2023

Elfsong commented Nov 5, 2023

Elfsong commented Nov 5, 2023

Elfsong left a comment

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 5, 2023 •

edited

Loading