Parallelize detensorizing context over batch #3729

EricMichaelSmith · 2021-06-16T21:24:18Z

Patch description
Currently, when generating, we convert the context Tensor to a list one example at a time, which is expensive; this PR parallelizes this operation over the entire batch.

Performance on 1000 generations (-mf zoo:blender/blender_90M/model -t blended_skill_talk -ne 1000), on an otherwise unoccupied devfair:

Original code, trial 1: median elapsed time 783 ms, mean 797 ms
Original code, trial 2: median 770 ms, mean 785 ms
This PR, trial 1: median 651 ms, mean 663 ms
This PR, trial 2: median 665 ms, mean 677 ms

Testing steps
CI checks (which seem to cover context blocking)

stephenroller

This one seems great to me... Haven't reviewed the other. We can talk more offline.

stephenroller · 2021-06-16T23:01:26Z

parlai/core/torch_generator_agent.py

+        """
+        if self.beam_context_block_ngram <= 0:
+            # We aren't context blocking, return empty tensor of the correct size
+            return torch.LongTensor([[]] * batch.batchsize)


Suggested change

return torch.LongTensor([[]] * batch.batchsize)

return torch.zeros(batch.batchsize, 0)

Yeah, that's cleaner - just added that, but with dtype=torch.long

EricMichaelSmith added 2 commits June 16, 2021 16:19

Batch context.tolist()

954cfe5

Cleanup

63a3031

EricMichaelSmith requested review from stephenroller and emilydinan June 16, 2021 21:24

facebook-github-bot added the CLA Signed label Jun 16, 2021

stephenroller approved these changes Jun 16, 2021

View reviewed changes

Suggestion

897535a

EricMichaelSmith merged commit e853e7c into master Jun 18, 2021

EricMichaelSmith deleted the context-blocking-speedups branch June 18, 2021 20:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelize detensorizing context over batch #3729

Parallelize detensorizing context over batch #3729

EricMichaelSmith commented Jun 16, 2021 •

edited

Loading

stephenroller left a comment

stephenroller Jun 16, 2021

EricMichaelSmith Jun 17, 2021

	return torch.LongTensor([[]] * batch.batchsize)
	return torch.zeros(batch.batchsize, 0)

Parallelize detensorizing context over batch #3729

Parallelize detensorizing context over batch #3729

Conversation

EricMichaelSmith commented Jun 16, 2021 • edited Loading

stephenroller left a comment

Choose a reason for hiding this comment

stephenroller Jun 16, 2021

Choose a reason for hiding this comment

EricMichaelSmith Jun 17, 2021

Choose a reason for hiding this comment

EricMichaelSmith commented Jun 16, 2021 •

edited

Loading