[seq2seq Examples] Use _step instead of generate for val, test #7004

setu4993 · 2020-09-08T05:53:54Z

Addresses a workaround I proposed in #6589 for limiting the memory consumption in .generate steps.

patil-suraj · 2020-09-08T06:24:34Z

examples/seq2seq/finetune.py

-            num_beams=self.eval_beams,
-        )
+        loss_tensors, logits = self._step(batch)
+        if self.eval_beams == 0:


IMO would be better to add an explicit argument like predict_with_generate to decide weather to use generate or not, this will also align it with #6769. Setting eval_beams to 0 , will cause this assert to fail

@patil-suraj : I updated that assertion above :).

Good point on creating an additional arg, though. I was veering on the side of fewer args, but this might also be something that's useful as an explicit arg than a hidden one. I don't have a strong opinion either way.

aah right, I missed the updated assertion .

Right, not super important. Just a nit

setu4993 · 2020-09-16T19:42:45Z

@sshleifer : Does it make sense to still add this? If yes, I can rebase and update based on Suraj's comments earlier. If not, will close.

sshleifer · 2020-09-17T00:13:05Z

Let's hold off for now. I think passing --eval_num_beams=1 is close enough to equivalent.
Thanks for trying!

Use _step instead of generate for val, test

3e7294b

setu4993 mentioned this pull request Sep 8, 2020

[seq2seq] finetune.sh OOMs in fp16 w torch 1.6 on colab #6589

Closed

patil-suraj reviewed Sep 8, 2020

View reviewed changes

sshleifer closed this Sep 17, 2020

swethmandava mentioned this pull request Sep 22, 2020

❓ Difficulties to reproduce BART results on CNN/DM by fine-tuning bart-large #5654

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[seq2seq Examples] Use _step instead of generate for val, test #7004

[seq2seq Examples] Use _step instead of generate for val, test #7004

setu4993 commented Sep 8, 2020

patil-suraj Sep 8, 2020

setu4993 Sep 8, 2020

patil-suraj Sep 8, 2020

setu4993 commented Sep 16, 2020

sshleifer commented Sep 17, 2020

[seq2seq Examples] Use _step instead of generate for val, test #7004

[seq2seq Examples] Use _step instead of generate for val, test #7004

Conversation

setu4993 commented Sep 8, 2020

patil-suraj Sep 8, 2020

Choose a reason for hiding this comment

setu4993 Sep 8, 2020

Choose a reason for hiding this comment

patil-suraj Sep 8, 2020

Choose a reason for hiding this comment

setu4993 commented Sep 16, 2020

sshleifer commented Sep 17, 2020