VisionEncoderDecoderModel gradient checkpointing #18513

metemadi · 2022-08-07T17:23:50Z

Feature request

Would love to be able to use gradient checkpointing on VisionEncoderDecoder model.

model.gradient_checkpointing_enable()
Traceback (most recent call last):
File "", line 1, in
File "/opt/conda/lib/python3.8/site-packages/transformers/modeling_utils.py", line 1418, in gradient_checkpointing_enable
raise ValueError(f"{self.class.name} does not support gradient checkpointing.")
ValueError: VisionEncoderDecoderModel does not support gradient checkpointing.

Motivation

Gradient checkpointing always helps increase the accessibility of larger models - HuggingFace is awesome!!!

Your contribution

Happy to take a stab at this if someone can point me to a previous example of this working with an EncoderDecoder model.

LysandreJik · 2022-08-09T08:17:21Z

@NielsRogge, have you seen such examples? :)

NielsRogge · 2022-08-09T11:17:08Z

Here's a PR that added gradient checkpointing to T5: https://github.com/huggingface/transformers/pull/11353/files

NielsRogge · 2022-08-26T12:13:46Z

Fixed per #18697

NielsRogge closed this as completed Aug 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VisionEncoderDecoderModel gradient checkpointing #18513

VisionEncoderDecoderModel gradient checkpointing #18513

metemadi commented Aug 7, 2022

LysandreJik commented Aug 9, 2022

NielsRogge commented Aug 9, 2022

NielsRogge commented Aug 26, 2022

VisionEncoderDecoderModel gradient checkpointing #18513

VisionEncoderDecoderModel gradient checkpointing #18513

Comments

metemadi commented Aug 7, 2022

Feature request

Motivation

Your contribution

LysandreJik commented Aug 9, 2022

NielsRogge commented Aug 9, 2022

NielsRogge commented Aug 26, 2022