Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

deprecate RLHF #592

Merged
merged 10 commits into from
Feb 1, 2024
Merged

deprecate RLHF #592

merged 10 commits into from
Feb 1, 2024

Conversation

pascal-pfeiffer
Copy link
Collaborator

@pascal-pfeiffer pascal-pfeiffer commented Jan 30, 2024

Starting to deprecate RLHF in favor of DPO/IPO optimization. RLHF will be fully removed in a future release.
begins: #545

Removal of train pipeline and UI components
Old experiments are still viewable

closes #183 as not planned
closes #175 as not planned
closes #529 as not planned

removal on train pipeline and UI components
Old experiments are still viewable
@pascal-pfeiffer pascal-pfeiffer marked this pull request as ready for review January 30, 2024 10:02
README.md Outdated
@@ -53,6 +53,7 @@ Using CLI for fine-tuning LLMs:

## What's New

- [PR 592](https://github.com/h2oai/h2o-llmstudio/pull/592) Starting to deprecate RLHF in favor of DPO/IPO optimization. Training is no disabled, but old experiments are still viewable. RLHF will be fully removed in a future release.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Training is disabled

Copy link
Contributor

@maxjeblick maxjeblick left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks a lot!

@pascal-pfeiffer pascal-pfeiffer merged commit 340df38 into main Feb 1, 2024
5 checks passed
@pascal-pfeiffer pascal-pfeiffer deleted the pp/deprecate_RLHF branch February 1, 2024 14:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants