We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No description provided.
The text was updated successfully, but these errors were encountered:
我们实验是在8张40G的A100上跑的,micro_batch_size是1。一张80G的A100应该可以跑,因为micro_batch_size可以开大,你可以试一下micro_batch_size开到4或者8,然后gradient_accumulation对应算一下就好,保证global_batch_size是64就行
Sorry, something went wrong.
No branches or pull requests
No description provided.
The text was updated successfully, but these errors were encountered: