docs: add initial version of docs for `PPOTrainer` #665

davidberenstein1957 · 2023-08-20T17:58:38Z

As discussed in #623, I am proposing more elaborate docs for the PPOTrainer.

Closes #623

HuggingFaceDocBuilderDev · 2023-09-01T13:10:08Z

The documentation is not available anymore as the PR was closed or merged.

lvwerra

Thanks a lot for the docs contribution! The preview is also working now :) The PR looks in pretty good shape to me! I added some small suggestions here and there. I'll also let @vwxyzjn and @younesbelkada have a look.

docs/source/ppo_trainer.mdx

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

- specified reference to reward model - added batched generator - added line of saving model - remove reference model

davidberenstein1957 · 2023-09-08T12:42:32Z

@lvwerra I already processed your comments and suggestions.

lvwerra

Looks good to me, some last small nits only!

docs/source/ppo_trainer.mdx

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

younesbelkada

This is very cool ! Thanks a lot for your great effort on this!

* docs: add initial version of docs for `PPOTrainer` * Apply suggestions from code review Leandro Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com> * updated docs based on feedback leandro - specified reference to reward model - added batched generator - added line of saving model - remove reference model * Apply suggestions from code review Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com> --------- Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

docs: add initial version of docs for PPOTrainer

297b516

younesbelkada mentioned this pull request Sep 6, 2023

Documentation and examples for DPOTrainer #524

Closed

lvwerra reviewed Sep 8, 2023

View reviewed changes

davidberenstein1957 and others added 3 commits September 8, 2023 13:58

Apply suggestions from code review Leandro

2f2c3ee

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

Apply suggestions from code review

a2c23db

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

updated docs based on feedback leandro

4c8225b

- specified reference to reward model - added batched generator - added line of saving model - remove reference model

lvwerra approved these changes Sep 8, 2023

View reviewed changes

docs/source/ppo_trainer.mdx Outdated Show resolved Hide resolved

docs/source/ppo_trainer.mdx Outdated Show resolved Hide resolved

lvwerra requested review from vwxyzjn and younesbelkada September 8, 2023 14:04

Apply suggestions from code review

1a1f290

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

younesbelkada approved these changes Sep 11, 2023

View reviewed changes

younesbelkada merged commit 3f7710a into huggingface:main Sep 14, 2023
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add initial version of docs for `PPOTrainer` #665

docs: add initial version of docs for `PPOTrainer` #665

davidberenstein1957 commented Aug 20, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 1, 2023 •

edited

Loading

lvwerra left a comment

davidberenstein1957 commented Sep 8, 2023

lvwerra left a comment

younesbelkada left a comment

docs: add initial version of docs for PPOTrainer #665

docs: add initial version of docs for PPOTrainer #665

Conversation

davidberenstein1957 commented Aug 20, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Sep 1, 2023 • edited Loading

lvwerra left a comment

Choose a reason for hiding this comment

davidberenstein1957 commented Sep 8, 2023

lvwerra left a comment

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

docs: add initial version of docs for `PPOTrainer` #665

docs: add initial version of docs for `PPOTrainer` #665

davidberenstein1957 commented Aug 20, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 1, 2023 •

edited

Loading