[KTO] prevent nans from appearing in metrics #1386

kawine · 2024-03-01T09:45:20Z

This PR addresses the issue of NaNs appearing in KTO metric logs (even when training works). See here: #1342 (comment)

This was fixed by doing an all_gather over lists of [nan] instead of empty lists when a particular metric has no entries for a given microbatch (for some reason, accelerate, unlike pytorch, cannot all_gather over empty tensors).

cc @kashif

…e batch_size losses

add reference to paper Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

HuggingFaceDocBuilderDev · 2024-03-01T10:42:36Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* add warning for imbalanced data * update documentation * update script commands to be same as in dpo * use batch_size KL examples and batch_size target examples to calculate batch_size losses * fix deepspeed issue * speed up forward with no_grad for KL * add some removed metrics * Update trl/trainer/kto_trainer.py * Update trl/trainer/kto_trainer.py * Update trl/trainer/kto_trainer.py add reference to paper Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * add more detailed comments * convert assert to ValueError * Update kto_trainer.py * precommit formatting * remove nans in metrics by gathering across machines * fix formatting --------- Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

kawine and others added 30 commits February 24, 2024 18:19

add warning for imbalanced data

6ee3be4

update documentation

22dd810

update script commands to be same as in dpo

8d14930

use batch_size KL examples and batch_size target examples to calculat…

8a490af

…e batch_size losses

fix deepspeed issue

f826600

speed up forward with no_grad for KL

688ed6c

Merge branch 'huggingface:main' into main

587517b

add some removed metrics

e128f09

Update trl/trainer/kto_trainer.py

2d860b8

Update trl/trainer/kto_trainer.py

48d25ff

Update trl/trainer/kto_trainer.py

392bcc0

add reference to paper Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

Update trl/trainer/kto_trainer.py

a42049f

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

Update trl/trainer/kto_trainer.py

5696814

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

Update trl/trainer/kto_trainer.py

000d5d8

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

Update trl/trainer/kto_trainer.py

2738d1f

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

Update trl/trainer/kto_trainer.py

d7f63c5

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

Update trl/trainer/kto_trainer.py

824da55

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

Update trl/trainer/kto_trainer.py

4399af4

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

Update trl/trainer/kto_trainer.py

69094be

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

Update trl/trainer/kto_trainer.py

73f7ed7

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

Update trl/trainer/kto_trainer.py

5b95aca

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

Update trl/trainer/kto_trainer.py

3102901

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

add more detailed comments

ca68f24

convert assert to ValueError

94fb375

Update kto_trainer.py

8f7e788

precommit formatting

ed19ed5

Merge branch 'main' of https://github.com/kawine/trl into main

310bd97

Merge branch 'huggingface:main' into main

639f4de

remove nans in metrics by gathering across machines

ee7d6a4

fix formatting

7ae95c2

kashif approved these changes Mar 1, 2024

View reviewed changes

kashif merged commit 067db75 into huggingface:main Mar 1, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[KTO] prevent nans from appearing in metrics #1386

[KTO] prevent nans from appearing in metrics #1386

kawine commented Mar 1, 2024

HuggingFaceDocBuilderDev commented Mar 1, 2024

[KTO] prevent nans from appearing in metrics #1386

[KTO] prevent nans from appearing in metrics #1386

Conversation

kawine commented Mar 1, 2024

HuggingFaceDocBuilderDev commented Mar 1, 2024