-
Notifications
You must be signed in to change notification settings - Fork 233
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mistral kills the process by taking too many RAM #1458
Comments
Mistral loads in The workaround to use the dtype set using
Let me know if this lowers the RAM usage. This will be fixed in the next release. |
Thanks for the bug! Just synced up with @tirthasheshpatel. We want to change two things here
These are both simple but important fixes, we should have a patch fix for this in a couple days. Thanks @deep-diver! |
I was running Mistral model on Colab environment w/ A100(40GB) and 80GB RAM. I loaded up the model successfully. However, when generate text, the RAM usage hit the peak, and the runtime got restarted.
Is this an expected behavior? or could there be bugs?
The text was updated successfully, but these errors were encountered: