stack_llama: add parameter to control max_length (to mitigate OOM errors) #359

teticio · 2023-05-11T13:17:27Z

As I "only" have 24 Gb VRAM, I run out of memory when training the reward model. The dataset was previously truancated to a hardcoded value of 512; I have made this a script argument.

HuggingFaceDocBuilderDev · 2023-05-11T13:21:11Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

lvwerra

Thanks, looks good to me!

younesbelkada

Awesome work! Thanks a lot for this!

add parameter to control max_length (to mitigate OOM errors)

4dc8498

lvwerra approved these changes May 11, 2023

View reviewed changes

lvwerra requested a review from younesbelkada May 11, 2023 13:22

younesbelkada approved these changes May 11, 2023

View reviewed changes

younesbelkada merged commit e0172fc into huggingface:main May 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stack_llama: add parameter to control max_length (to mitigate OOM errors) #359

stack_llama: add parameter to control max_length (to mitigate OOM errors) #359

teticio commented May 11, 2023

HuggingFaceDocBuilderDev commented May 11, 2023

lvwerra left a comment

younesbelkada left a comment

stack_llama: add parameter to control max_length (to mitigate OOM errors) #359

stack_llama: add parameter to control max_length (to mitigate OOM errors) #359

Conversation

teticio commented May 11, 2023

HuggingFaceDocBuilderDev commented May 11, 2023

lvwerra left a comment

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment