Lack of memory on the GPU during training #16

dwz138831 · 2024-03-31T12:17:44Z

Hi, I'm reproducing with the latest version of the code on 8 A30 GPUs (24GB of memory each) and it shows CUDA out of memory at the beginning of the 1st epoch, what is the cause of this.

LinShan-Bin · 2024-04-10T03:58:57Z

To reproduce the full results, you need 80G memory. The baseline method (no semantics, contracted_coord = False and auxiliary_frame = False) can be reproduced with 24G memory.

ZiyangYan · 2024-04-16T11:56:05Z

To reproduce the full results, you need 80G memory. The baseline method (no semantics, contracted_coord = False and auxiliary_frame = False) can be reproduced with 24G memory.

Hi, I change the config as you mentioned, and run it in 4 24G A30 GPU, but still has OOM problem

LinShan-Bin · 2024-04-22T04:09:51Z

To reproduce the full results, you need 80G memory. The baseline method (no semantics, contracted_coord = False and auxiliary_frame = False) can be reproduced with 24G memory.

Hi, I change the config as you mentioned, and run it in 4 24G A30 GPU, but still has OOM problem

For contracted_coord = False, we recommend you use a voxel size of [16, 200, 200]. This keeps the resolution of the inner part and removes the contracted outer part. Though we use [24, 300, 300] in our experiments for fair comparisons, to save memory you have to cut down the voxel size.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lack of memory on the GPU during training #16

Lack of memory on the GPU during training #16

dwz138831 commented Mar 31, 2024

LinShan-Bin commented Apr 10, 2024

ZiyangYan commented Apr 16, 2024 •

edited

Loading

LinShan-Bin commented Apr 22, 2024

Lack of memory on the GPU during training #16

Lack of memory on the GPU during training #16

Comments

dwz138831 commented Mar 31, 2024

LinShan-Bin commented Apr 10, 2024

ZiyangYan commented Apr 16, 2024 • edited Loading

LinShan-Bin commented Apr 22, 2024

ZiyangYan commented Apr 16, 2024 •

edited

Loading