Delete `state_dict` to release memory as early as possible #18832

ydshieh · 2022-08-31T12:24:04Z

What does this PR do?

Note that this is not a real memory issue. A call to gc.collect() at the end of from_pretrained() works well too.

However this PR finds and simply del state_dict at the end of _load_state_dict_into_model(), and GC is able to perform housekeeping on its own at a earlier time.

HuggingFaceDocBuilderDev · 2022-08-31T12:35:37Z

The documentation is not available anymore as the PR was closed or merged.

ydshieh · 2022-08-31T12:45:34Z

The change regarding having a new argument state_dict in the nested function load is to pass black check, otherwise we get

src/transformers/modeling_utils.py:422:17: F821 undefined name 'state_dict'

with the new line del state_dict. (It's quite strange though)

ydshieh · 2022-08-31T14:29:58Z

Ready for review.

ydshieh · 2022-08-31T16:32:10Z

The failing test is test_encodings_from_xnli_dataset which is irrelevant to this PR.

sgugger

Thanks for fixing!

src/transformers/modeling_utils.py

LysandreJik

Looks good to me, thanks @ydshieh!

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

…ce#18832) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ydshieh requested a review from LysandreJik August 31, 2022 12:24

ydshieh requested a review from sgugger August 31, 2022 12:46

ydshieh mentioned this pull request Aug 31, 2022

Memory increment and release when loading model via PretrainedModel.from_pretrained #18782

Closed

ydshieh marked this pull request as draft August 31, 2022 12:50

ydshieh added 3 commits August 31, 2022 16:28

del a copy of state_dict to release memory as early as possible

50f7efd

A change in order to pass black check

52f2388

fix missing arg

45db5e6

ydshieh force-pushed the delete_state_dict_to_release_mem branch from b4e52bd to 45db5e6 Compare August 31, 2022 14:29

ydshieh marked this pull request as ready for review August 31, 2022 14:29

sgugger approved these changes Aug 31, 2022

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

LysandreJik approved these changes Sep 1, 2022

View reviewed changes

Update src/transformers/modeling_utils.py

3dcb2b7

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ydshieh merged commit 563a8d5 into main Sep 1, 2022

ydshieh deleted the delete_state_dict_to_release_mem branch September 1, 2022 08:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delete `state_dict` to release memory as early as possible #18832

Delete `state_dict` to release memory as early as possible #18832

ydshieh commented Aug 31, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 31, 2022 •

edited

Loading

ydshieh commented Aug 31, 2022 •

edited

Loading

ydshieh commented Aug 31, 2022

ydshieh commented Aug 31, 2022

sgugger left a comment

LysandreJik left a comment

Delete state_dict to release memory as early as possible #18832

Delete state_dict to release memory as early as possible #18832

Conversation

ydshieh commented Aug 31, 2022 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Aug 31, 2022 • edited Loading

ydshieh commented Aug 31, 2022 • edited Loading

ydshieh commented Aug 31, 2022

ydshieh commented Aug 31, 2022

sgugger left a comment

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

Delete `state_dict` to release memory as early as possible #18832

Delete `state_dict` to release memory as early as possible #18832

ydshieh commented Aug 31, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 31, 2022 •

edited

Loading

ydshieh commented Aug 31, 2022 •

edited

Loading