-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training crashes after 7000 #235
Comments
I guess, it could be OOM. |
I also get this, not sure if OOM. It seems to only happen around the time iter 7000 saves. I was even running it in a docker container and it crashed the host machine. @nivibilla did you ever figure out what was causing this? |
Similar issue. showed killed after 7000 iterations for db/drjohnson |
I had the same error and it was due to OOM. When saving Gaussians there is a spike in CPU RAM usage. You're training with 423 images in Colab so I'm guessing the RAM consumption was already high. When it tried to save Gaussians, consumption must have spiked and caused an OOM. The quick fix is to not save Gaussians at iteration 7000 and avoid the spike. Only save at 30,000 iterations (or whatever your last iteration is) using the The better fix is given in #667. It decreases CPU RAM consumption and prevents this. |
I also ran into the same issue. In my case it was not OOM-related, though. I was able to solve the problem by changing the line
of
. |
I encountered the same problem. In my case, when training to 7000 iterations and save_ply, the code got stuck here and couldn't continue. The reason seems to be that there are too many
works for me, hoping it would be helpful. |
hi,
it gets to 7000 steps, outputs
^C
. Doesn't save a point cloud eitherThe text was updated successfully, but these errors were encountered: