Assign image tensors to `data_device` immediately on creation. #667

GaneshBannur · 2024-02-21T11:24:10Z

The tensors which are created from PIL images are first created on the CPU.

gaussian-splatting/utils/general_utils.py

Line 23 in d9fad7b

resized_image = torch.from_numpy(np.array(resized_image_PIL)) / 255.0

If data_device is "cuda" they are later moved to the GPU. Normally, unreferenced tensors on the CPU should be released but PyTorch doesn't seem to do this. This results in high CPU RAM consumption for the entire training duration even when data_device is "cuda".

Moving the tensors to data_device immediately on creation results in a dramatic decrease in CPU RAM consumption when data_device is "cuda". When training on a T4 instance on Colab with 200 images, CPU RAM consumption went from 10GB down to 2GB. The GPU vRAM consumption doesn't increase as tensors are eventually moved to the GPU anyway.

It might help to move all tensors to data_device immediately on creation since PyTorch doesn't seem to deallocate RAM for CPU tensors.

assign image tensors to data_device on creation

fc17065

GaneshBannur mentioned this pull request Feb 21, 2024

Training crashes after 7000 #235

Open

nnmhuy added a commit to nnmhuy/gaussian-splatting that referenced this pull request Sep 24, 2024

fix cpu ram following graphdeco-inria#667

894c4e0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assign image tensors to `data_device` immediately on creation. #667

Assign image tensors to `data_device` immediately on creation. #667

GaneshBannur commented Feb 21, 2024 •

edited

Loading

Assign image tensors to data_device immediately on creation. #667

Are you sure you want to change the base?

Assign image tensors to data_device immediately on creation. #667

Conversation

GaneshBannur commented Feb 21, 2024 • edited Loading

Assign image tensors to `data_device` immediately on creation. #667

Assign image tensors to `data_device` immediately on creation. #667

GaneshBannur commented Feb 21, 2024 •

edited

Loading