Fix decode_input_ids to bare T5Model and improve doc #18791

ekagra-ranjan · 2022-08-28T15:09:38Z

What does this PR do?

Fix 1: use the tokenizer to obtain the labels as tensors. docs/source/en/model_doc/t5.mdx
Fix 2: src/transformers/models/t5/
- Present case: T5 prepends the decoder_input_ids with pad token. This preprocessing is handled internally by T5ForConditionalGeneration by shifting the labels to the right.
- Issue: This preprocessing needs to be done manually while using bare T5Model. This is missing from the example which uses bare T5Model.
- Proposed Fix: Added a preprocessing step in the example so that the input matches with what T5 expects at its decoder. The PR reuses the _shift_right() method which is an internal function to T5. Please let me know if we can rename _shift_right() to shift_right() or if there is a better way to handle this.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

@sgugger @patrickvonplaten @patil-suraj

HuggingFaceDocBuilderDev · 2022-08-28T15:22:39Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for your PR! @patrickvonplaten is better suited to review it as he knows T5 better than I do :-)

patrickvonplaten · 2022-09-05T15:01:03Z

docs/source/en/model_doc/t5.mdx

+...     padding="longest",
+...     max_length=max_target_length,
+...     truncation=True,
+...     return_tensors="pt",


src/transformers/models/t5/modeling_flax_t5.py

src/transformers/models/t5/modeling_t5.py

src/transformers/models/t5/modeling_tf_t5.py

patrickvonplaten · 2022-09-05T15:02:39Z

Thanks for the fixes!

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

ekagra-ranjan · 2022-09-05T15:35:06Z

@patrickvonplaten Thanks for the review! Applied your suggestions.

* use tokenizer to output tensor * add preprocessing for decoder_input_ids for bare T5Model * add preprocessing to tf and flax * linting * linting * Update src/transformers/models/t5/modeling_flax_t5.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/t5/modeling_tf_t5.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/t5/modeling_t5.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

ekagra-ranjan added 3 commits August 28, 2022 20:11

use tokenizer to output tensor

6d0c84e

add preprocessing for decoder_input_ids for bare T5Model

e2bf88b

add preprocessing to tf and flax

8fbd558

ekagra-ranjan added 2 commits August 28, 2022 20:55

linting

6978581

linting

92670b1

sgugger reviewed Aug 31, 2022

View reviewed changes

patrickvonplaten reviewed Sep 5, 2022

View reviewed changes

src/transformers/models/t5/modeling_flax_t5.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Sep 5, 2022

View reviewed changes

src/transformers/models/t5/modeling_t5.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Sep 5, 2022

View reviewed changes

src/transformers/models/t5/modeling_tf_t5.py Outdated Show resolved Hide resolved

patrickvonplaten approved these changes Sep 5, 2022

View reviewed changes

ekagra-ranjan and others added 3 commits September 5, 2022 20:34

Update src/transformers/models/t5/modeling_flax_t5.py

785d4d0

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Update src/transformers/models/t5/modeling_tf_t5.py

8af355c

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Update src/transformers/models/t5/modeling_t5.py

e78d0ef

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

patrickvonplaten merged commit f85acb4 into huggingface:main Sep 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix decode_input_ids to bare T5Model and improve doc #18791

Fix decode_input_ids to bare T5Model and improve doc #18791

ekagra-ranjan commented Aug 28, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 28, 2022 •

edited

Loading

sgugger left a comment

patrickvonplaten Sep 5, 2022

patrickvonplaten commented Sep 5, 2022

ekagra-ranjan commented Sep 5, 2022

Fix decode_input_ids to bare T5Model and improve doc #18791

Fix decode_input_ids to bare T5Model and improve doc #18791

Conversation

ekagra-ranjan commented Aug 28, 2022 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Aug 28, 2022 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

patrickvonplaten Sep 5, 2022

Choose a reason for hiding this comment

patrickvonplaten commented Sep 5, 2022

ekagra-ranjan commented Sep 5, 2022

ekagra-ranjan commented Aug 28, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 28, 2022 •

edited

Loading