Reduce memory consumption in batched_forward_pass #234

ohashi56225 · 2023-03-21T08:49:11Z

This PR reduces memory consumption in batched_forward_pass of PPOTrainer, by avoiding the storage of logits when they are not necessary.

Before this PR, batched_forward_pass stored all of the model's logits all the time like other tensors such as values and logprobs. Here, logits tensors have a much larger size (batch_size * tokens * vocabulary_size) compared to logprobs and values tensors (batch_size × tokens), consuming a significant amount of cuda memory.

I have modified batched_forward_pass to avoid unnecessary storage of logits, which is only required when calculating entropy in the loss method.

HuggingFaceDocBuilderDev · 2023-03-21T09:04:09Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada

Thanks a lot for fixing and for taking care of the memory consumption
This looks very good to me!
Would love to hear @lvwerra 's thoughts here

trl/trainer/ppo_trainer.py

lvwerra

Looks great, thanks!

Reduce memory consumption by not storing logits in forward_pass

bd862fe

ohashi56225 changed the title ~~Reduce memory consumption by avoiding logits storage in forward_pass~~ Reduce memory consumption in batched_forward_pass Mar 21, 2023

younesbelkada approved these changes Mar 21, 2023

View reviewed changes

trl/trainer/ppo_trainer.py Show resolved Hide resolved

Add docstring of return_logits

a3f9c9a

younesbelkada requested a review from lvwerra March 21, 2023 12:27

lvwerra approved these changes Mar 22, 2023

View reviewed changes

lvwerra merged commit a6ebdb6 into huggingface:main Mar 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memory consumption in batched_forward_pass #234

Reduce memory consumption in batched_forward_pass #234

ohashi56225 commented Mar 21, 2023

HuggingFaceDocBuilderDev commented Mar 21, 2023 •

edited

Loading

younesbelkada left a comment

lvwerra left a comment

Reduce memory consumption in batched_forward_pass #234

Reduce memory consumption in batched_forward_pass #234

Conversation

ohashi56225 commented Mar 21, 2023

HuggingFaceDocBuilderDev commented Mar 21, 2023 • edited Loading

younesbelkada left a comment

Choose a reason for hiding this comment

lvwerra left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 21, 2023 •

edited

Loading