-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RMS PROP 8bit support #1336
RMS PROP 8bit support #1336
Comments
Hi @NickWithBotronics ! |
I love you and the HF team y'all are so awesome! Thank you! Also on a side note, there is a 4bit adamnw optimizer that claims the same performance as 32bit adamnw I don’t know how well these lower bit optimizers work from first hand use, but they have a awesome paper, and code. “ https://github.com/thu-ml/low-bit-optimizers ” . Maybe it’s worth looking into integrating into the HF trainer. |
Current only rms prop is supported by trl for DPO training. But bitsandbytes contains 8bit RMSprop that could be used to save on vram, making DPO QLORA on 7b’s fit better without going into shared memory, on 24 gig cards. Could we work on an implementation of 8bit rms prop into trl?
The text was updated successfully, but these errors were encountered: