ORPO (Or DPO?) #1350

exdownloader · 2024-05-28T18:22:09Z

exdownloader
May 28, 2024

I've seen a few discussions about DPO for sd-scripts, specifically this and this.

However there hasn't been further movement on either, from what I can tell.
ORPO is related to DPO and some even consider it superior.
I was recently browsing the various forks of sd-scripts and found the following repo which appears to be under active development.

The branch doesn't function for me, erroring out with the following:

I think that any kind of preference training would be interesting to explore and would be happy to see this kind of feature in sd-scripts but I have not been able to successfully contact the developer of this fork and so I'm raising awareness here in case there is a chance to gain traction.
After speaking with other AI/ML researchers and developers, I have been informed that regular DPO training is "easy" to implement.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ORPO (Or DPO?) #1350

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

ORPO (Or DPO?) #1350

exdownloader May 28, 2024

Replies: 0 comments

exdownloader
May 28, 2024