[TGA] Abstract TGA candidate ranking and fix ranking for BART #3455

emilydinan · 2021-02-16T22:56:10Z

Patch description
Abstract Torch Generator Agent candidate ranking ability to a helper function.

Then, override this helper function in BART so that we may delete the score for the start token.

Testing steps
for BART:

parlai em -t convai2 --rank-candidates True -m bart --skip-generation True

for non-BART:

parlai em -t convai2 --rank-candidates True -mf zoo:blender/blender_90M/model --skip-generation Tru

klshuster · 2021-02-17T18:23:24Z

parlai/agents/bart/bart.py

+            cands, _ = self._pad_tensor(batch.candidate_vecs[i])
+            scores, _ = self.model.decode_forced(enc, cands)
+            # ignore the score for the start token
+            scores = scores[:, 1:, :]


is this the only line that is different?

I'm wondering now if we can solve all of these issues by just overriding BartModel.output to do this computation...

that would allow us to get rid of the duplicate compute_loss and _construct_token_losses functions as well

the other change is that it needs reshaping...

it looks like you just took the reshape out of the cross_entropy call? i.e. that's already in the base call

not quite -- in the original function it has a call to scores.view(num_cands * cands.size(1), -1), but this breaks with BART unless we call reshape. i think it would probably be fine to change the view call to reshape instead in TGA, but this is a slightly more expensive operation (copy instead of view). what do you think?

since people don't seem to use this functionality too often i'll just change

yeah given that this is rarely, if ever, used, it'll be better to just make them both reshape

klshuster

🚀 awesome!

2 things:

nit: a couple of lint errors
Let's wait for BART tests to pass prior to merging. did you happen to run them locally?

emilydinan · 2021-02-19T14:56:29Z

🚀 awesome!

2 things:

nit: a couple of lint errors

Let's wait for BART tests to pass prior to merging. did you happen to run them locally?

ran bart locally but looks like distillbart, looking into it

emilydinan · 2021-02-19T15:54:18Z

@EricMichaelSmith can you take a look at the changes to distillbart when you get a chance?

stephenroller

It's nice to see the cleanup of BART. Did you verify an older checkpoint didn't have results change?

stephenroller · 2021-02-22T04:10:58Z

parlai/core/torch_generator_agent.py

@@ -864,6 +864,37 @@ def _add_generation_metrics(self, batch, preds):
        """
        pass

+    def _rank_eval_label_candidates(self, batch, batchsize):


Maybe this should be a public method?

EricMichaelSmith

Seems reasonable, minor comment

EricMichaelSmith · 2021-02-22T14:14:05Z

projects/anti_scaling/distillation.py

@@ -338,6 +341,9 @@ def _perform_forward_passes(self, batch: Batch) -> ForwardPassOutputs:
        mask = self._manipulate_mask(


nit: maybe mask should now be score_mask to differentiate it from decoder_mask?

emilydinan · 2021-02-22T15:58:58Z

It's nice to see the cleanup of BART. Did you verify an older checkpoint didn't have results change?

yep! tried training for a bit on convai2 and comparing results

emilydinan · 2021-02-22T18:21:51Z

Circle CI hanging on self chat tests, I ran locally and they passed

EricMichaelSmith · 2021-02-22T18:54:40Z

Circle CI hanging on self chat tests, I ran locally and they passed

hmm, it looks like tests/crowdsourcing/tasks/model_chat/test_model_image_chat.py::TestModelImageChat::test_base_task in the crowdsourcing CI check is hanging too - it might be good to run that locally as well. Sometimes there is flakiness with the crowdsourcing CI tests (that's an issue that Jack and I know about), but it might be worth checking that this PR doesn't affect that test in some way

fix bart ranking

f1e75ae

facebook-github-bot added the CLA Signed label Feb 16, 2021

emilydinan requested review from klshuster and stephenroller February 16, 2021 22:56

klshuster reviewed Feb 17, 2021

View reviewed changes

Emily Dinan added 2 commits February 17, 2021 17:13

lint

8317339

refactor bart

73ef4ed

klshuster approved these changes Feb 18, 2021

View reviewed changes

klshuster and others added 3 commits February 19, 2021 10:50

decoder mask in anti-scaling

4d17810

lint

0e54e24

Merge remote-tracking branch 'origin/tga_ranker' into tga_ranker

9f5c007

emilydinan requested a review from EricMichaelSmith February 19, 2021 15:54

stephenroller approved these changes Feb 22, 2021

View reviewed changes

EricMichaelSmith approved these changes Feb 22, 2021

View reviewed changes

@stephenroller comment

f5b47da

rank

f3aa7e1

emilydinan merged commit 7293ffe into master Feb 22, 2021

emilydinan deleted the tga_ranker branch February 22, 2021 19:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TGA] Abstract TGA candidate ranking and fix ranking for BART #3455

[TGA] Abstract TGA candidate ranking and fix ranking for BART #3455

emilydinan commented Feb 16, 2021

klshuster Feb 17, 2021

emilydinan Feb 17, 2021

klshuster Feb 18, 2021

emilydinan Feb 18, 2021

emilydinan Feb 18, 2021

klshuster Feb 18, 2021

klshuster left a comment

emilydinan commented Feb 19, 2021

emilydinan commented Feb 19, 2021

stephenroller left a comment

stephenroller Feb 22, 2021

EricMichaelSmith left a comment

EricMichaelSmith Feb 22, 2021

emilydinan commented Feb 22, 2021

emilydinan commented Feb 22, 2021

EricMichaelSmith commented Feb 22, 2021

		@@ -338,6 +341,9 @@ def _perform_forward_passes(self, batch: Batch) -> ForwardPassOutputs:
		mask = self._manipulate_mask(

[TGA] Abstract TGA candidate ranking and fix ranking for BART #3455

[TGA] Abstract TGA candidate ranking and fix ranking for BART #3455

Conversation

emilydinan commented Feb 16, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

klshuster left a comment

Choose a reason for hiding this comment

emilydinan commented Feb 19, 2021

emilydinan commented Feb 19, 2021

stephenroller left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

EricMichaelSmith left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

emilydinan commented Feb 22, 2021

emilydinan commented Feb 22, 2021

EricMichaelSmith commented Feb 22, 2021