CUDA OOM when FineTuning on T4 15 GB #1

mellahysf · 2023-07-31T21:52:52Z

@juyongjiang Thank you for this great work!

How to finetune the model using less memory?

I'm facing CUDA OOM while trying to finetune on google colab pro with T4 15 GB...

Thanks!

juyongjiang · 2023-08-01T09:51:18Z

@mellahysf Hi, essentially, we use the low-rank adaptation (LoRA) to fine-tune the LLMs, making it feasible on a single GPU with less memory. So if only 15GB memory is available, I suggest reducing the rank parameter --lora_r=4 while it may decrease the model performance, and setting the smaller --batch_size=64. Have a try. : )

mellahysf · 2023-08-02T15:24:29Z

@juyongjiang it didn't work with less memory... I tested with GPU 25 GB and it works. thanks.

mellahysf changed the title ~~FineTuning on T4 15 GB~~ CUDA OOM when FineTuning on T4 15 GB Jul 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA OOM when FineTuning on T4 15 GB #1

CUDA OOM when FineTuning on T4 15 GB #1

mellahysf commented Jul 31, 2023

juyongjiang commented Aug 1, 2023 •

edited

Loading

mellahysf commented Aug 2, 2023

CUDA OOM when FineTuning on T4 15 GB #1

CUDA OOM when FineTuning on T4 15 GB #1

Comments

mellahysf commented Jul 31, 2023

juyongjiang commented Aug 1, 2023 • edited Loading

mellahysf commented Aug 2, 2023

juyongjiang commented Aug 1, 2023 •

edited

Loading