RuntimeError: CUDA error: out of memory #52

RuiqingTang · 2024-04-08T05:01:33Z

My GPU is 4060 with 8GB of VRAM. Is it too small? However, I saw someone using a 3090 and still encountering errors.
#38
Here is the info:
Training progress: 12%|█▎ | 5000/40000 [06:40<48:38, 11.99it/s, Loss=0.0467618]Traceback (most recent call last):
File "E:\Projects\Python_projects\3D_Vsion\Deformable-3D-Gaussians\train.py", line 274, in
training(lp.extract(args), op.extract(args), pp.extract(args), args.test_iterations, args.save_iterations)
File "E:\Projects\Python_projects\3D_Vsion\Deformable-3D-Gaussians\train.py", line 132, in training
dataset.load2gpu_on_the_fly, dataset.is_6dof)
File "E:\Projects\Python_projects\3D_Vsion\Deformable-3D-Gaussians\train.py", line 221, in training_report
images = torch.cat((images, image.unsqueeze(0)), dim=0)
RuntimeError: CUDA error: out of memory
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Training progress: 12%|█▎ | 5000/40000 [06:40<46:44, 12.48it/s, Loss=0.0467618]

RuiqingTang · 2024-04-08T05:06:28Z

The dataset is NeRF-DS.

ingra14m · 2024-04-09T06:13:45Z

Hi, thanks for your interest in this work.

In my experiments, NeRF-DS will not encounter oom on 3090. Can you provide the command leading to this problem?

RuiqingTang · 2024-04-09T10:24:59Z

The command I executed is as follows:
python train.py -s G:/dataset/3Dvision/real-world/NeRF-DS/cup -m output/exp-ds1 --eval

yangbaoquan · 2024-04-24T01:11:48Z

Hi, thanks for your interest in this work.

In my experiments, NeRF-DS will not encounter oom on 3090. Can you provide the command leading to this problem?

I have encountered the same issue on RTX4090.

RuiqingTang · 2024-04-26T07:06:00Z

@yangbaoquan, I uniformly sampled 52 images from the original dataset and fixed this error, but the reconstruction was poor.

yangbaoquan · 2024-04-26T08:26:55Z

@yangbaoquan, I uniformly sampled 52 images from the original dataset and fixed this error, but the reconstruction was poor.

@RuiqingTang As a temporary solution, I comment the following lines and run training code successfully.

# Log and save
# cur_psnr = training_report(tb_writer, iteration, Ll1, loss, l1_loss, iter_start.elapsed_time(iter_end),
#                            testing_iterations, scene, render, (pipe, background), deform,
#                            dataset.load2gpu_on_the_fly, dataset.is_6dof)
# if iteration in testing_iterations:
#     if cur_psnr.item() > best_psnr:
#         best_psnr = cur_psnr.item()
#         best_iteration = iteration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: CUDA error: out of memory #52

RuntimeError: CUDA error: out of memory #52

RuiqingTang commented Apr 8, 2024

RuiqingTang commented Apr 8, 2024

ingra14m commented Apr 9, 2024

RuiqingTang commented Apr 9, 2024

yangbaoquan commented Apr 24, 2024

RuiqingTang commented Apr 26, 2024

yangbaoquan commented Apr 26, 2024

RuntimeError: CUDA error: out of memory #52

RuntimeError: CUDA error: out of memory #52

Comments

RuiqingTang commented Apr 8, 2024

RuiqingTang commented Apr 8, 2024

ingra14m commented Apr 9, 2024

RuiqingTang commented Apr 9, 2024

yangbaoquan commented Apr 24, 2024

RuiqingTang commented Apr 26, 2024

yangbaoquan commented Apr 26, 2024