-
Notifications
You must be signed in to change notification settings - Fork 811
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to prefetch data? #660
Comments
which dataset are you using now? We have re-written language modeling and text classification datasets to make them more compatible with |
@speedcell4 lmk if you're still seeing these bottlenecks. There's also dataloader v2 coming out soon which should have some improvements on data loading perf. |
a "coming out soon" without reference + closing issue without confirming |
@speedcell4 we are currently going through the backlog of issues and responding to/closing out issues that we think are legacy and may no longer be relevant as we've made a lot of updates to our library over the last couple of years. Feel free to keep the issue open if you're still running into the performance bottlenecks. |
Currently, the bottleneck of my model training is on the data loading part, is there any example about how to prefetch data? Like the
pin_memory
andnum_workers
arguments oftorch.utils.data.DataLoader
The text was updated successfully, but these errors were encountered: