[`gradient_checkpointing`] default to use it for torch 2.3 #28538

ArthurZucker · 2024-01-16T16:39:44Z

What does this PR do?

Fixes #28536 in preparation for next torch release

younesbelkada

Makes sense!

hiyouga · 2024-01-19T16:09:18Z

Why do we use reentrant gc by default? It said the non-reentrant gc can be more advantageous than the reentrant version: https://pytorch.org/docs/2.0/checkpoint.html#torch.utils.checkpoint.checkpoint

younesbelkada · 2024-01-21T13:10:17Z

@hiyouga the use_reentrant=True is used by default in PT anyway so if you set it to None, use_reentrant will be set to True

github-actions · 2024-02-17T08:03:04Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

HuggingFaceDocBuilderDev · 2024-02-19T04:02:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…entrant

lucasjinreal · 2024-04-09T03:35:24Z

I upgrade transformers to latest, still got this warning, and this warning is logged every single step

s/env-3.9.2/lib/python3.9/site-packages/torch/utils/checkpoint.py:460: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
  warnings.warn(
/data/miniconda3/envs/env-3.9.2/lib/python3.9/site-packages/torch/utils/checkpoint.py:90: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  warnings.warn(
/data/miniconda3/envs/env-3.9.2/lib/python3.9/site-packages/torch/utils/checkpoint.py:460: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
  warnings.warn(
/data/miniconda3/envs/env-3.9.2/lib/python3.9/site-packages/torch/utils/checkpoint.py:90: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  warnings.warn(
/data/miniconda3/envs/env-3.9.2/lib/python3.9/site-packages/torch/utils/checkpoint.py:460: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
  warnings.warn(
/data/miniconda3/envs/env-3.9.2/lib/python3.9/site-packages/torch/utils/checkpoint.py:90: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  warnings.warn(

how to disable it?

ArthurZucker · 2024-04-18T08:34:20Z

Can you open a new issue with a proper reproducer ?

* default to use it * style

default to use it

b6f6d11

ArthurZucker changed the title ~~default to use it~~ [gradient_checkpointing] default to use it for torch 2.3 Jan 16, 2024

younesbelkada approved these changes Jan 17, 2024

View reviewed changes

Merge branch 'main' into use-rentrant

49aaf6e

ArthurZucker marked this pull request as ready for review February 19, 2024 03:40

style

6592f39

Merge branch 'main' of github.com:huggingface/transformers into use-r…

6986b54

…entrant

ArthurZucker merged commit 9094abe into main Feb 20, 2024
19 of 21 checks passed

ArthurZucker deleted the use-rentrant branch February 20, 2024 01:23

amyeroberts mentioned this pull request Mar 13, 2024

Why is the default use_reentrant=True when no kwargs were set? #29638

Closed

LZHgrla mentioned this pull request May 7, 2024

2.3 - If use_reentrant is not explicitly passed, an exception will now be raised InternLM/xtuner#637

Closed

itazap pushed a commit that referenced this pull request May 14, 2024

[gradient_checkpointing] default to use it for torch 2.3 (#28538)

de6b94c

* default to use it * style

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`gradient_checkpointing`] default to use it for torch 2.3 #28538

[`gradient_checkpointing`] default to use it for torch 2.3 #28538

ArthurZucker commented Jan 16, 2024

younesbelkada left a comment

hiyouga commented Jan 19, 2024

younesbelkada commented Jan 21, 2024

github-actions bot commented Feb 17, 2024

HuggingFaceDocBuilderDev commented Feb 19, 2024

lucasjinreal commented Apr 9, 2024

ArthurZucker commented Apr 18, 2024

[gradient_checkpointing] default to use it for torch 2.3 #28538

[gradient_checkpointing] default to use it for torch 2.3 #28538

Conversation

ArthurZucker commented Jan 16, 2024

What does this PR do?

younesbelkada left a comment

Choose a reason for hiding this comment

hiyouga commented Jan 19, 2024

younesbelkada commented Jan 21, 2024

github-actions bot commented Feb 17, 2024

HuggingFaceDocBuilderDev commented Feb 19, 2024

lucasjinreal commented Apr 9, 2024

ArthurZucker commented Apr 18, 2024

[`gradient_checkpointing`] default to use it for torch 2.3 #28538

[`gradient_checkpointing`] default to use it for torch 2.3 #28538