-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions about training code for UltraRM/UltraCM #4
Comments
+1,Any plan for this issue? |
Thanks for your interest! For reward modeling, we use code in this repo: https://github.com/Dahoas/reward-modeling I also recommend HuggingFace TRL for easy implementation: https://huggingface.co/docs/trl/index |
Thank you! |
Great Work! And thanks for the contribution. May I ask you if you have plans to release the training code for UltraRM/UltraCM?
The text was updated successfully, but these errors were encountered: