[DPO] add KTO loss #1075

kashif · 2023-12-08T10:18:37Z

add Tahneman-Tversky optimization (KTO) loss function from https://github.com/ContextualAI/HALOs]

HuggingFaceDocBuilderDev · 2023-12-08T10:22:54Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

lewtun

Super fast addition @kashif ! I've checked the equations and it looks good to me 🔥

trl/trainer/dpo_trainer.py

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

vwxyzjn

Nice work @kashif! This is blazing fast. Maybe in a follow-up PR you can add a test example using anthropic's hh?

trl/trainer/dpo_trainer.py

docs/source/dpo_trainer.mdx

* add KTO loss * fix docs * Update trl/trainer/dpo_trainer.py Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * formatting * add link to papers --------- Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

add KTO loss

c06a2db

fix docs

17f99d3

lewtun approved these changes Dec 8, 2023

View reviewed changes

trl/trainer/dpo_trainer.py Outdated Show resolved Hide resolved

kashif and others added 2 commits December 8, 2023 12:06

Update trl/trainer/dpo_trainer.py

8f73064

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

formatting

f8baf45

vwxyzjn approved these changes Dec 8, 2023

View reviewed changes

trl/trainer/dpo_trainer.py Outdated Show resolved Hide resolved

docs/source/dpo_trainer.mdx Outdated Show resolved Hide resolved

add link to papers

786e2cf

kashif mentioned this pull request Dec 10, 2023

Implement KTO #1072

Closed

kashif merged commit d275cb4 into huggingface:main Dec 11, 2023
9 checks passed

kashif deleted the KTO branch December 11, 2023 10:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DPO] add KTO loss #1075

[DPO] add KTO loss #1075

kashif commented Dec 8, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 8, 2023

lewtun left a comment

vwxyzjn left a comment

[DPO] add KTO loss #1075

[DPO] add KTO loss #1075

Conversation

kashif commented Dec 8, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Dec 8, 2023

lewtun left a comment

Choose a reason for hiding this comment

vwxyzjn left a comment

Choose a reason for hiding this comment

kashif commented Dec 8, 2023 •

edited

Loading