Problem about cuda-out-of-memory #3

Chelsea-abab · 2023-10-05T10:21:45Z

Hi,
I want to reproduce your results via your provided codes. But I was stuck in the fine-tuning section. No matter how I reduce the batch size and input image size, it still says cuda out of memory. I run the codes by your instructions on a nvidia3090 gpu with 24g memory. But it seems that before the program loads the images, all the memory has already been allocated. So although I reduce the batch size to 1 and input size to 64, the cuda out of memory problem is still there. Does the memory be allocated to the models? Do you have any idea on how to solve this problem? I guess 24g is enough to tune a diffusion model.

yu-rp · 2023-10-08T04:33:26Z

Hi @Chelsea-abab ,

Reducing the GPU consumption is a quite general problem. Here are some ideas to achieve that:

Use Mixed Precision Training: This involves using half-precision floating points (FP16) instead of single precision (FP32). Libraries such as NVIDIA's Apex make this easier.
Gradient Accumulation: Instead of updating weights every batch, accumulate gradients over multiple batches and then make a single update. This effectively simulates a larger batch size without the memory requirements.
Use Gradient Checkpointing: Trade computation for memory by re-computing intermediate activations during the backward pass.

Or, for the finetuning of the diffusion model, you may also try to find other repositories that include much more efficient implementation.

Hope these ideas may help you to address the problem.

Chelsea-abab · 2023-10-09T08:12:55Z

OK, thanks for your reply! I thought there are some hyperparameters could be adjusted to reduce the GPU consumption. Then I will try these general methods to address this problem. Thanks again!

jxthyatt · 2023-11-23T09:23:05Z

@Chelsea-abab did u solve this problem?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem about cuda-out-of-memory #3

Problem about cuda-out-of-memory #3

Chelsea-abab commented Oct 5, 2023

yu-rp commented Oct 8, 2023

Chelsea-abab commented Oct 9, 2023

jxthyatt commented Nov 23, 2023

Problem about cuda-out-of-memory #3

Problem about cuda-out-of-memory #3

Comments

Chelsea-abab commented Oct 5, 2023

yu-rp commented Oct 8, 2023

Chelsea-abab commented Oct 9, 2023

jxthyatt commented Nov 23, 2023