Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

do we hava a plan feature about Rft? #80

Closed
zjrwtx opened this issue Dec 8, 2024 · 3 comments
Closed

do we hava a plan feature about Rft? #80

zjrwtx opened this issue Dec 8, 2024 · 3 comments
Labels
enhancement New feature or request

Comments

@zjrwtx
Copy link
Contributor

zjrwtx commented Dec 8, 2024

Feature request

https://openai.com/form/rft-research-program/

Motivation

Enhance model professionalism: Reinforcement Fine-Tuning (RFT) is designed to optimize models through high-quality task data sets to make the model perform more accurately in complex tasks in a specific domain, thereby elevating the model from "high school level" to "doctoral level expert" capability.

Reduced training data requirements: RFT technology allows developers to fine-tune models using data sets from tens to thousands of high-quality tasks, meaning significant performance gains can be achieved even with limited data (sometimes just a few dozen samples).

Enhanced reasoning: Unlike traditional fine-tuning, reinforcement fine-tuning does not simply make the model "remember the answer", but by training the model to learn to reason in a specific domain, to find the right answer, thereby improving the model's ability to solve similar problems.

Your contribution

i guess i can cooperate with you guys

@zjrwtx zjrwtx added the enhancement New feature or request label Dec 8, 2024
@YanSong97
Copy link
Collaborator

Yes absolutely! We add RFT to our TODO list with high priority! We welcome any form of collaboration!

@zjrwtx
Copy link
Contributor Author

zjrwtx commented Dec 11, 2024

got it,thanks!

@zjrwtx zjrwtx closed this as completed Dec 11, 2024
@kargarisaac
Copy link

Is anyone working on this? I'm interested to contribute.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants