Skip to content

Conversation

@kashif
Copy link
Collaborator

@kashif kashif commented Jul 31, 2025

What does this PR do?

Add support for AlphaPO method into the CPOTrainer instead of a dedicated trainer as in #3776

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@kashif kashif mentioned this pull request Aug 1, 2025
5 tasks
@kashif kashif changed the title [CPO] Add AlphaPO method in CPOTrainer [CPO] Add AlphaPO method via CPOTrainer Aug 1, 2025
@kashif kashif requested a review from qgallouedec August 1, 2025 14:14
Copy link
Member

@qgallouedec qgallouedec left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

@qgallouedec qgallouedec changed the title [CPO] Add AlphaPO method via CPOTrainer 🗿 [CPO] Add AlphaPO method via CPOTrainer Aug 17, 2025
@qgallouedec qgallouedec merged commit b971844 into huggingface:main Aug 17, 2025
10 checks passed
@kashif kashif deleted the alphaPO-cpo-integration branch August 17, 2025 09:23
LuisVasquezBSC pushed a commit to langtech-bsc/trl that referenced this pull request Aug 28, 2025
Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
Co-authored-by: Quentin Gallouédec <gallouedec.quentin@gmail.com>
LuisVasquezBSC pushed a commit to langtech-bsc/trl that referenced this pull request Aug 28, 2025
Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
Co-authored-by: Quentin Gallouédec <gallouedec.quentin@gmail.com>
SamY724 pushed a commit to SamY724/trl that referenced this pull request Sep 6, 2025
Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
Co-authored-by: Quentin Gallouédec <gallouedec.quentin@gmail.com>
@qgallouedec qgallouedec mentioned this pull request Oct 30, 2025
54 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants