Parallelize detensorizing token IDs during tree search #3730

EricMichaelSmith · 2021-06-16T21:46:25Z

Patch description
Currently, during any kind of tree search, the token ID tensor will be detensorized one item at a time, which is expensive. This PR rewrites that to detensorize the whole batch at once.

Performance on 1000 generations (-mf zoo:blender/blender_90M/model -t blended_skill_talk -ne 1000), on an otherwise unoccupied devfair:

Original code, trial 1: median elapsed time 783 ms, mean 797 ms
Original code, trial 2: median 770 ms, mean 785 ms
This PR, trial 1: median 660 ms, mean 671 ms
This PR, trial 2: median 656 ms, mean 667 ms

Testing steps
CI checks

stephenroller

so good.

stephenroller · 2021-06-16T23:03:05Z

do we need to port this change to torchscript?

EricMichaelSmith · 2021-06-17T21:20:32Z

do we need to port this change to torchscript?

@stephenroller Hmm, doesn't look like it, because torchscript export currently only supports greedy search

Parallelize tok_ids

87ab902

EricMichaelSmith requested a review from stephenroller June 16, 2021 21:46

facebook-github-bot added the CLA Signed label Jun 16, 2021

stephenroller approved these changes Jun 16, 2021

View reviewed changes

EricMichaelSmith merged commit d02d864 into master Jun 18, 2021

EricMichaelSmith deleted the tok-id-speedup branch June 18, 2021 20:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelize detensorizing token IDs during tree search #3730

Parallelize detensorizing token IDs during tree search #3730

EricMichaelSmith commented Jun 16, 2021 •

edited

Loading

stephenroller left a comment

stephenroller commented Jun 16, 2021

EricMichaelSmith commented Jun 17, 2021 •

edited

Loading

Parallelize detensorizing token IDs during tree search #3730

Parallelize detensorizing token IDs during tree search #3730

Conversation

EricMichaelSmith commented Jun 16, 2021 • edited Loading

stephenroller left a comment

Choose a reason for hiding this comment

stephenroller commented Jun 16, 2021

EricMichaelSmith commented Jun 17, 2021 • edited Loading

EricMichaelSmith commented Jun 16, 2021 •

edited

Loading

EricMichaelSmith commented Jun 17, 2021 •

edited

Loading