Include padding mask in generation #2096

joecummings · 2023-03-07T14:12:31Z

Bug

Expect batched input to match single input e.g.

[seq1, ... seq_m] -> generate -> [output1, ...., output_m]
[seq1] -> generate -> [output1]

Before this would not create the same output1. The issue was that the src_key_padding_mask was not being propagated forward.

Fix

Create padding mask and add it to model_kwargs and pass it to the forward function.

rshraga · 2023-03-07T15:11:18Z

torchtext/prototype/generate.py

@@ -48,7 +48,7 @@ def _prepare_decoder_ids_for_generation(
            return torch.ones((batch_size, 1), dtype=torch.long, device=device) * pad_idx

    def greedy_search(
-        self, input_ids: torch.Tensor, max_length: int, eos_idx: int, pad_idx: Optional[int] = None, **model_kwargs
+        self, input_ids: torch.Tensor, max_length: int, eos_idx: int, pad_idx: int, **model_kwargs


Does changing pas_idx from Optional to required break any call sites?

Nope. Only being called from the entry point method atm.

rshraga · 2023-03-07T15:12:17Z

torchtext/prototype/generate.py


            # Append the next tokens to the previous tokens
-            input_ids = torch.cat([input_ids, next_tokens], dim=-1)
+            input_ids = torch.cat([input_ids, next_tokens[:, None]], dim=-1)


what does the [:, None] do here?

Same thing as unsqueezing the last dim

Nayef211 · 2023-03-07T16:15:45Z

test/integration_tests/test_generate.py

+        tokens_for_single_example = generation_model.generate(inputs, num_beams=1, max_length=30)
+        generated_text_for_single_example = self.transform.decode(tokens_for_single_example.tolist())
+
+        self.assertEqual(generated_text[0], generated_text_for_single_example[-1])


Why do we do generated_text_for_single_example[-1] instead of generated_text_for_single_example[0]?

Was originally going to pass multiple through the second pass, but did not. Both get the same result though. -1 == 0

Nayef211

LGTM

Make sure to include padding mask in generation

3a42b9f

facebook-github-bot added the cla signed label Mar 7, 2023

joecummings requested review from Nayef211 and rshraga March 7, 2023 14:17

joecummings mentioned this pull request Mar 7, 2023

[v0.15] Release Tracker #2079

Open

8 tasks

rshraga reviewed Mar 7, 2023

View reviewed changes

Nayef211 reviewed Mar 7, 2023

View reviewed changes

joecummings requested review from Nayef211 and rshraga March 7, 2023 18:41

Nayef211 approved these changes Mar 7, 2023

View reviewed changes

joecummings merged commit db26565 into pytorch:main Mar 7, 2023

joecummings deleted the fix-diff-generation-batch branch March 7, 2023 19:53

joecummings added a commit that referenced this pull request Mar 7, 2023

Make sure to include padding mask in generation (#2096)

0bca5a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Include padding mask in generation #2096

Include padding mask in generation #2096

Uh oh!

joecummings commented Mar 7, 2023

Uh oh!

rshraga Mar 7, 2023

Uh oh!

joecummings Mar 7, 2023

Uh oh!

rshraga Mar 7, 2023

Uh oh!

joecummings Mar 7, 2023

Uh oh!

Nayef211 Mar 7, 2023

Uh oh!

joecummings Mar 7, 2023

Uh oh!

Nayef211 left a comment

Uh oh!

Uh oh!

Include padding mask in generation #2096

Include padding mask in generation #2096

Uh oh!

Conversation

joecummings commented Mar 7, 2023

Bug

Fix

Uh oh!

rshraga Mar 7, 2023

Choose a reason for hiding this comment

Uh oh!

joecummings Mar 7, 2023

Choose a reason for hiding this comment

Uh oh!

rshraga Mar 7, 2023

Choose a reason for hiding this comment

Uh oh!

joecummings Mar 7, 2023

Choose a reason for hiding this comment

Uh oh!

Nayef211 Mar 7, 2023

Choose a reason for hiding this comment

Uh oh!

joecummings Mar 7, 2023

Choose a reason for hiding this comment

Uh oh!

Nayef211 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!