You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm getting CUDA out of memory for training v5 on 1024 size Image with batch_size4. I have trained v5 on 1024 image_size with batch_size4 before without any error. But now I'm getting this error.
But the weird fact is while training, GPU usage is around 10GB and it trains perfectly. But while validating it throws the error.
The text was updated successfully, but these errors were encountered:
@awsaf49 I reviewed the code here, it seems like testing batch sizes were inadvertently affected by PR #2125, I've pushed a fix for this in #2587, though note this should only affect Multi-GPU training memory issues during testing.
@awsaf49 thanks for the info! If you believe you have a reproducible issue, we suggest you close this issue and raise a new one using the 🐛 Bug Report template, providing screenshots and a minimum reproducible example to help us better understand and diagnose your problem. Thank you!
I'm getting
CUDA out of memory
for training v5 on1024
size Image with batch_size4
. I have trained v5 on1024
image_size with batch_size4
before without any error. But now I'm getting this error.But the weird fact is while training,
GPU
usage is around 10GB and it trains perfectly. But while validating it throws the error.The text was updated successfully, but these errors were encountered: