do we hava a plan feature about Rft? #80

zjrwtx · 2024-12-08T09:36:49Z

Feature request

https://openai.com/form/rft-research-program/

Motivation

Enhance model professionalism: Reinforcement Fine-Tuning (RFT) is designed to optimize models through high-quality task data sets to make the model perform more accurately in complex tasks in a specific domain, thereby elevating the model from "high school level" to "doctoral level expert" capability.

Reduced training data requirements: RFT technology allows developers to fine-tune models using data sets from tens to thousands of high-quality tasks, meaning significant performance gains can be achieved even with limited data (sometimes just a few dozen samples).

Enhanced reasoning: Unlike traditional fine-tuning, reinforcement fine-tuning does not simply make the model "remember the answer", but by training the model to learn to reason in a specific domain, to find the right answer, thereby improving the model's ability to solve similar problems.

Your contribution

i guess i can cooperate with you guys

YanSong97 · 2024-12-09T19:23:38Z

Yes absolutely! We add RFT to our TODO list with high priority! We welcome any form of collaboration!

zjrwtx · 2024-12-11T05:19:20Z

got it,thanks!

kargarisaac · 2024-12-25T18:13:59Z

Is anyone working on this? I'm interested to contribute.

zjrwtx added the enhancement New feature or request label Dec 8, 2024

zjrwtx closed this as completed Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

do we hava a plan feature about Rft? #80

do we hava a plan feature about Rft? #80

zjrwtx commented Dec 8, 2024

YanSong97 commented Dec 9, 2024

zjrwtx commented Dec 11, 2024

kargarisaac commented Dec 25, 2024

do we hava a plan feature about Rft? #80

do we hava a plan feature about Rft? #80

Comments

zjrwtx commented Dec 8, 2024

Feature request

Motivation

Your contribution

YanSong97 commented Dec 9, 2024

zjrwtx commented Dec 11, 2024

kargarisaac commented Dec 25, 2024