lower GPU memory usage #10

blackight · 2024-06-15T13:15:26Z

It seems that the fp16 setting is not effective. I tried to use fp16 manually, and offload the autoencoder and CLIP to cpu memory during ddim denoising, and can run a 45x512x768 video on 12G GPU memory.

wangxiang1230 · 2024-06-15T14:03:11Z

It seems that the fp16 setting is not effective. I tried to use fp16 manually, and offload the autoencoder and CLIP to cpu memory during ddim denoising, and can run a 45x512x768 video on 12G GPU memory.

Hi, thank you for your attention. You can merge your code into our code or write your improved code here to help others run models with fewer resources.

wangxiang1230 · 2024-06-15T14:12:30Z

It seems that the fp16 setting is not effective. I tried to use fp16 manually, and offload the autoencoder and CLIP to cpu memory during ddim denoising, and can run a 45x512x768 video on 12G GPU memory.

Hi, I offload the autoencoder and CLIP, and the GPU memory used is ~22G. How do you reduce the memory? This may be useful.

blackight · 2024-06-15T14:35:05Z

    model = model.to(gpu)
    model.eval()
    model.to(torch.float16) # add this line
    # model = DistributedDataParallel(model, device_ids=[gpu]) if not cfg.debug else model # I delete DDP, because in my PC it increace memory usage
    ............
    clip_encoder.cpu() # add this line
    autoencoder.cpu() # add this line
    torch.cuda.empty_cache() # add this line
    video_data = diffusion.ddim_sample_loop(
    noise=noise_one,
    model=model.eval(), 
    model_kwargs=model_kwargs_one,
    guide_scale=cfg.guide_scale,
    ddim_timesteps=cfg.ddim_timesteps,
    eta=0.0)
    # if run forward of  autoencoder or clip_encoder second times, load them again
    clip_encoder.cuda()
    autoencoder.cuda()

my code is like this, need both fp16 in unet and cpu offload

wangxiang1230 · 2024-06-15T15:31:11Z

Good, thanks for your contribution. I will add it to our code.

zephirusgit · 2024-06-16T01:48:04Z

I haven't been able to get it to run well, with an RTX 2060 with 12GB of vram, I see that it uses 21GB of shared memory, it starts but it was for a while and it didn't go over 0% so I stopped it, from what I see it is designed for some guy of gpus cruster, and I have had to modify it so that it does not ask me for that since there is no nccl in Windows. (I don't have any more GPUs either)

wangxiang1230 · 2024-06-16T01:58:54Z

I haven't been able to get it to run well, with an RTX 2060 with 12GB of vram, I see that it uses 21GB of shared memory, it starts but it was for a while and it didn't go over 0% so I stopped it, from what I see it is designed for some guy of gpus cruster, and I have had to modify it so that it does not ask me for that since there is no nccl in Windows. (I don't have any more GPUs either)

Hi, thank you for your attention. We noticed your problem, since we didn't have a windows machine, so we couldn't help modify it. You can try to change max_frames to 16 or 24. We also welcome your comments and hope you to improve the code. We will incorporate the improved code into our code so that more people (researchers using different systems) can run the program. Thank you.

blackight · 2024-06-16T06:05:55Z

I haven't been able to get it to run well, with an RTX 2060 with 12GB of vram, I see that it uses 21GB of shared memory, it starts but it was for a while and it didn't go over 0% so I stopped it, from what I see it is designed for some guy of gpus cruster, and I have had to modify it so that it does not ask me for that since there is no nccl in Windows. (I don't have any more GPUs either)

you can delete the DistributedDataParallel in the code to prevent from using gpus cluster, or change "nccl" to "gloo". refer to https://pytorch.org/docs/stable/distributed.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lower GPU memory usage #10

lower GPU memory usage #10

blackight commented Jun 15, 2024 •

edited

Loading

wangxiang1230 commented Jun 15, 2024

wangxiang1230 commented Jun 15, 2024

blackight commented Jun 15, 2024 •

edited

Loading

wangxiang1230 commented Jun 15, 2024

zephirusgit commented Jun 16, 2024

wangxiang1230 commented Jun 16, 2024 •

edited

Loading

blackight commented Jun 16, 2024

lower GPU memory usage #10

lower GPU memory usage #10

Comments

blackight commented Jun 15, 2024 • edited Loading

wangxiang1230 commented Jun 15, 2024

wangxiang1230 commented Jun 15, 2024

blackight commented Jun 15, 2024 • edited Loading

wangxiang1230 commented Jun 15, 2024

zephirusgit commented Jun 16, 2024

wangxiang1230 commented Jun 16, 2024 • edited Loading

blackight commented Jun 16, 2024

blackight commented Jun 15, 2024 •

edited

Loading

blackight commented Jun 15, 2024 •

edited

Loading

wangxiang1230 commented Jun 16, 2024 •

edited

Loading