Generate: validate `model_kwargs` on TF (and catch typos in generate arguments) #18651

gante · 2022-08-16T13:28:04Z

What does this PR do?

TF version of #18261

Adds model_kwargs validation to TF generate, which also catches typos in the arguments. See the PR above for more details and an example of the error message users will see.

Since TF had no dedicated file for generate tests, I took the liberty to create it and move some existing tests there (>70% of the diff is due to moving things around :) ). The test for this new check was also added there.

HuggingFaceDocBuilderDev · 2022-08-16T13:49:02Z

The documentation is not available anymore as the PR was closed or merged.

gante · 2022-08-16T14:09:58Z

tests/generation/test_generation_tf_utils.py

+class UtilsFunctionsTest(unittest.TestCase):
+
+    # tests whether the top_k_top_p_filtering function behaves as expected
+    def test_top_k_top_p_filtering(self):


moved from test_modeling_tf_common, no changes

gante · 2022-08-16T14:10:20Z

tests/generation/test_generation_tf_utils.py

+class TFGenerationIntegrationTests(unittest.TestCase):
+
+    @slow
+    def test_generate_tf_function_export(self):


moved from test_modeling_tf_common, added the @slow (takes >30s)

Cool that makes sense (similar to PyTorch).
Also just FYI in PyTorch we're testing currently much more than in TF mainly because we've allowed to return hidden_states and attentios. We could do the same for TF at some point

gante · 2022-08-16T14:11:21Z

src/transformers/generation_tf_utils.py

@@ -1288,6 +1290,29 @@ def adjust_logits_during_generation(
        else:
            return logits

+    def _validate_model_kwargs(self, model_kwargs: Dict[str, Any]):


Same as for PyTorch (here), with self.forward replaced with self.call

gante · 2022-08-16T14:12:24Z

tests/generation/test_generation_utils.py

@@ -2702,8 +2702,8 @@ def test_constrained_beam_search_mixin_type_checks(self):
            model.generate(input_ids, force_words_ids=[[[-1]]])

    def test_validate_generation_inputs(self):
-        tokenizer = AutoTokenizer.from_pretrained("patrickvonplaten/t5-tiny-random")
-        model = AutoModelForSeq2SeqLM.from_pretrained("patrickvonplaten/t5-tiny-random")
+        tokenizer = AutoTokenizer.from_pretrained("hf-internal-testing/tiny-random-t5")


The model is not relevant for the test, but not using a model from hf-internal-testing was an oversight in the previous PR :D

Yes def a good idea!

patrickvonplaten · 2022-08-26T18:31:24Z

src/transformers/generation_tf_utils.py

@@ -1483,6 +1508,9 @@ def _generate(
        # generate sequences without allowing bad_words to be generated
        outputs = model.generate(input_ids=input_ids, max_length=100, do_sample=True, bad_words_ids=bad_words_ids)
        ```"""
+        # 0. Validate model kwargs


Haha, fine with me! Can also increase all numbers otherwise

patrickvonplaten

Thanks for the clean-up

…arguments) (huggingface#18651)

gante commented Aug 16, 2022

View reviewed changes

gante requested review from patrickvonplaten and Rocketknight1 August 16, 2022 14:16

gante marked this pull request as ready for review August 16, 2022 14:17

gante changed the title ~~Generate: validate model_kwargs on TF (and catch typos in generate arguments)~~ Generate: validate model_kwargs on TF (and catch typos in generate arguments) Aug 16, 2022

patrickvonplaten reviewed Aug 26, 2022

View reviewed changes

patrickvonplaten approved these changes Aug 26, 2022

View reviewed changes

gante added 5 commits September 2, 2022 13:02

Add TF generate kwarg validation

5c521c9

derp

41414b9

create test_generation_tf_utils.py; Add kwarg check tests

82d085e

make fixup

c8a24c5

make fixup

3eb5072

gante force-pushed the tf_generate_kwarg_valid branch from 293120d to 3eb5072 Compare September 2, 2022 13:12

gante merged commit 9196f48 into huggingface:main Sep 2, 2022

gante deleted the tf_generate_kwarg_valid branch September 2, 2022 15:25

oneraghavan pushed a commit to oneraghavan/transformers that referenced this pull request Sep 26, 2022

Generate: validate model_kwargs on TF (and catch typos in generate …

fa86aa0

…arguments) (huggingface#18651)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate: validate `model_kwargs` on TF (and catch typos in generate arguments) #18651

Generate: validate `model_kwargs` on TF (and catch typos in generate arguments) #18651

gante commented Aug 16, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 16, 2022 •

edited

Loading

gante Aug 16, 2022

gante Aug 16, 2022

patrickvonplaten Aug 26, 2022

gante Aug 16, 2022

gante Aug 16, 2022

patrickvonplaten Aug 26, 2022

patrickvonplaten Aug 26, 2022

patrickvonplaten left a comment

Generate: validate model_kwargs on TF (and catch typos in generate arguments) #18651

Generate: validate model_kwargs on TF (and catch typos in generate arguments) #18651

Conversation

gante commented Aug 16, 2022 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Aug 16, 2022 • edited Loading

gante Aug 16, 2022

Choose a reason for hiding this comment

gante Aug 16, 2022

Choose a reason for hiding this comment

patrickvonplaten Aug 26, 2022

Choose a reason for hiding this comment

gante Aug 16, 2022

Choose a reason for hiding this comment

gante Aug 16, 2022

Choose a reason for hiding this comment

patrickvonplaten Aug 26, 2022

Choose a reason for hiding this comment

patrickvonplaten Aug 26, 2022

Choose a reason for hiding this comment

patrickvonplaten left a comment

Choose a reason for hiding this comment

Generate: validate `model_kwargs` on TF (and catch typos in generate arguments) #18651

Generate: validate `model_kwargs` on TF (and catch typos in generate arguments) #18651

gante commented Aug 16, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 16, 2022 •

edited

Loading