[GenerationOutputs] Fix GenerationOutputs Tests #9443

patrickvonplaten · 2021-01-06T17:30:07Z

What does this PR do?

The GenerationOutputs PR: #9150 was not rebased, so that the cicrle ci on master is red now. This PR fixes it.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors which may be interested in your PR.

src/transformers/generation_utils.py

patrickvonplaten · 2021-01-06T17:57:44Z

tests/test_generation_utils.py

@@ -522,6 +522,7 @@ def test_greedy_generate_dict_outputs_use_cache(self):
                return

            config.use_cache = True
+            config.is_decoder = True


make sure to use causal mask for models like BERT, RoBERTa, ...

patrickvonplaten · 2021-01-06T17:58:57Z

src/transformers/models/encoder_decoder/modeling_encoder_decoder.py

@@ -455,7 +455,7 @@ def prepare_inputs_for_generation(
            "decoder_attention_mask": decoder_attention_mask,
            "decoder_input_ids": decoder_inputs["input_ids"],
            "encoder_outputs": encoder_outputs,
-            "past_key_values": past,
+            "past_key_values": decoder_inputs["past_key_values"],


@patil-suraj -> think this is a bit safer

patrickvonplaten · 2021-01-06T17:59:36Z

src/transformers/models/bert_generation/modeling_bert_generation.py

@@ -570,7 +570,7 @@ def prepare_inputs_for_generation(self, input_ids, past=None, attention_mask=Non
        if past is not None:
            input_ids = input_ids[:, -1:]

-        return {"input_ids": input_ids, "attention_mask": attention_mask}
+        return {"input_ids": input_ids, "attention_mask": attention_mask, "past_key_values": past}


@patil-suraj we forgot to add this in the BERT cache PR. Bert-like models can also be used as stand-alone BertForCausalLM models -> so we need to return past_key_values here.

patrickvonplaten · 2021-01-06T18:19:10Z

This PR actually made me correct 2 bugs additionally:

past_key_values for BertForCausalLM
T5 should not return T5 cross attentions if just encoder model -> make sure encoder model has never config.is_decoder=True

sgugger

Thanks for fixing!

fix generation models

d2a875e

sgugger reviewed Jan 6, 2021

View reviewed changes

src/transformers/generation_utils.py Outdated Show resolved Hide resolved

patrickvonplaten added 4 commits January 6, 2021 18:47

fix led

d7e2335

fix docs

b254222

add is_decoder

1b81d35

fix last docstrings

b4e7be4

patrickvonplaten commented Jan 6, 2021

View reviewed changes

patrickvonplaten added 2 commits January 6, 2021 19:00

make style

5cd65a0

fix t5 cross attentions

49ebcae

correct t5

0f9b6c2

patrickvonplaten merged commit b8462b5 into huggingface:master Jan 6, 2021

sgugger approved these changes Jan 6, 2021

View reviewed changes

patrickvonplaten deleted the fix_output_generate branch January 6, 2021 20:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GenerationOutputs] Fix GenerationOutputs Tests #9443

[GenerationOutputs] Fix GenerationOutputs Tests #9443

patrickvonplaten commented Jan 6, 2021 •

edited

Loading

patrickvonplaten Jan 6, 2021 •

edited

Loading

patrickvonplaten Jan 6, 2021

patrickvonplaten Jan 6, 2021

patrickvonplaten commented Jan 6, 2021 •

edited

Loading

sgugger left a comment

[GenerationOutputs] Fix GenerationOutputs Tests #9443

[GenerationOutputs] Fix GenerationOutputs Tests #9443

Conversation

patrickvonplaten commented Jan 6, 2021 • edited Loading

What does this PR do?

Before submitting

Who can review?

patrickvonplaten Jan 6, 2021 • edited Loading

Choose a reason for hiding this comment

patrickvonplaten Jan 6, 2021

Choose a reason for hiding this comment

patrickvonplaten Jan 6, 2021

Choose a reason for hiding this comment

patrickvonplaten commented Jan 6, 2021 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

patrickvonplaten commented Jan 6, 2021 •

edited

Loading

patrickvonplaten Jan 6, 2021 •

edited

Loading

patrickvonplaten commented Jan 6, 2021 •

edited

Loading