Limit torch_agent.Batch to tensors #3389

stephenroller · 2021-01-16T18:23:27Z

Patch description
Where possible, limit the attributes of our Batch object to only tensors. Additionally, delay cudafying until late and switch to using batch.batchsize for many operations.

This is in preparation of a fresh attempt of Background Preprocessing.

Testing steps
CI

klshuster

sweet

klshuster · 2021-03-09T17:09:43Z

docs/source/tutorial_metrics.md

 | `ctpb`                  | Context tokens per batch |
 | `ctps`                  | Context tokens per second |
+| `ctrun`                 | Fraction of samples with some context truncation |


nit: can we have it be ctrunc; this reads to me like ct run which is a bit confusing

klshuster · 2021-03-09T17:10:04Z

docs/source/tutorial_metrics.md

 | `loss`                  | Loss |
 | `lr`                    | The most recent learning rate applied |
 | `ltpb`                  | Label tokens per batch |
 | `ltps`                  | Label tokens per second |
+| `ltrun`                 | Fraction of samples with some label truncation |


ditto. totally fine to leave it if we want to keep abbreviations 5 chars or less

klshuster · 2021-03-09T17:12:55Z

parlai/agents/hred/hred.py

-            padded_context_vec,
-            torch.tensor(hist_lens, dtype=torch.long, device=self.device),
+        # sum here is list concat, not addition
+        context_vec, hist_lens_ = self._pad_tensor(


perhaps unrelated, but padded tensor returning the lengths as simply the lengths of the input lists is not super intuitive, especially when an optional argument is the pad token; ideally you'd want to return the lengths of the unpadded input lists

is this why you recompute hist_lens below?

idk bro I'm just trying to get that one to work lol

my b i didn't realize this was in the hred agent

klshuster · 2021-03-09T17:13:31Z

parlai/agents/test_agents/test_agents.py

@@ -114,18 +114,16 @@ def train_step(self, batch):
        """
        Return confirmation of training.
        """
-        return Output(['Training {}!'.format(i) for i in range(len(batch.text_vec))])
+        return Output(['Training {}!'.format(i) for i in range(batch.batchsize)])


nit: can use f string here

klshuster · 2021-03-09T17:13:36Z

parlai/agents/test_agents/test_agents.py

-                    i, batch.observations[i]['text']
-                )
-                for i in range(len(batch.text_vec))
+                'Evaluating {} (responding to {})!'.format(i, batch.text_vec.tolist())


klshuster · 2021-03-09T17:14:33Z

parlai/core/torch_agent.py

@@ -123,8 +113,12 @@ def __init__(
        valid_indices=None,
        candidates=None,
        candidate_vecs=None,
+        reward=None,


where is this reward coming from?

Some teachers provide it now and we batchify it. It's used by Unlikelihood and others.

could you include it in the Batch docstring?

klshuster · 2021-03-09T17:15:29Z

parlai/core/torch_agent.py

+        """
+        Move all tensors in the batch to a device.
+
+        Happens in place. Note that valid_indices and fields starting with an


i like these semantics. can we make that more clear in the Batch object description? specifically underscored fields being exempt

klshuster · 2021-03-09T17:17:59Z

parlai/core/torch_agent.py

+    _context_original_length: Optional[torch.LongTensor]
+    _context_truncate_rate: Optional[torch.LongTensor]
+    _label_original_length: Optional[torch.LongTensor]
+    _label_truncate_rate: Optional[torch.LongTensor]

    def __init__(


perhaps we can warn here if we're passing in a non-tensor?

For specifically these 4 or others? We have non-tensors (bools, Nones, etc).

wait so... we're not fully getting rid of non-tensors here?

Nope. I benchmarked them and they're not painful. Only the complex objects are.

how complex is complex? does this mean I can just make my own batch object and keep strings in there?

Ya I mean, we don't have any hard limitation. You can even do full observations if you want, you'll just pay a penalty with background workers. But any Batch object you manually want to add things to is still allowed.

I'll leave notes in batch descriptions

klshuster · 2021-03-09T17:25:57Z

parlai/utils/torch.py

@@ -65,11 +65,9 @@ def atomic_save(state_dict: Any, path: str) -> None:
 def padded_tensor(
    items: List[Union[List[int], torch.LongTensor]],
    pad_idx: int = 0,
-    use_cuda: bool = False,


this will almost certainly break some internal stuff but i suppose those can be dealt with individually

EricMichaelSmith

Seems reasonable - great to have this streamlining!

EricMichaelSmith · 2021-03-09T17:58:51Z

parlai/core/torch_agent.py

@@ -96,22 +88,20 @@ class Batch(AttrDict):

    :param image:
        list of image features in the format specified by the --image-mode arg.
-
-    :param observations:
-        the original observations in the batched order
    """


Nit: we might not need to define the new underscored args if we don't think the user should mess with them, but maybe we should add a sentence explaining generally what they are?

parlai/core/torch_agent.py

projects/style_gen/classifier.py

stephenroller added 2 commits January 16, 2021 12:58

Avoid passing around observations, other non-tensor values.

148652c

Delay cuda.

98f2498

facebook-github-bot added the CLA Signed label Jan 16, 2021

stephenroller requested a review from klshuster January 16, 2021 18:25

stephenroller added 22 commits January 16, 2021 13:27

More batchsize.

cc9ee42

Typ.

6b69be4

Partially migrate hred.

6f07bb3

Fix hred.

2663a7f

Got TRA done I think.

c9840c2

Whoops.

723f05b

Merge branch 'master' into background

d8aa074

Whoops, fix bad iteration in hred.

6b2ba8d

Lint.

ff6eee2

Move rewards into standard batchify.

134846f

missing device.

14d5078

Match device.

a1a3a9c

Fix img.

7a0826e

Get rid of NLTK bleu

a9bcbc5

Merge branch 'master' into background

2b824b0

Fixup controllable dialogue.

95563f6

Fixup stylegen.

48623db

Fixup wizard.

aab7ffe

Lint.

0652103

Add repr. Fix the one thing I'm supposed to be fixing.

4203ae1

Support debug mode where observations is included.

82327c0

Add debug flag for jason

a33cfdd

stephenroller marked this pull request as ready for review February 26, 2021 13:50

stephenroller added 3 commits February 26, 2021 11:57

Merge branch 'master' into background

96d073f

Merge branch 'master' into background

8676a40

Delay cuda'ing. Include truncation stats in batch.

5002f3a

stephenroller added 3 commits March 8, 2021 16:41

Fix memnn

9bc06a9

Update docs, fix unit test.

e7b1608

Sigh.

bd6d7e9

stephenroller requested review from dexterju27 and EricMichaelSmith March 8, 2021 23:33

klshuster approved these changes Mar 9, 2021

View reviewed changes

EricMichaelSmith approved these changes Mar 9, 2021

View reviewed changes

spencerp mentioned this pull request Mar 11, 2021

Add length of truncation to logs #3508

Merged

stephenroller added 3 commits March 15, 2021 13:01

Merge branch 'master' into background

f4c39ea

Reviewer comments, and more docstrings.

8b65255

Metrics.

89e7a9c

stephenroller merged commit 00286c5 into master Mar 16, 2021

stephenroller deleted the background branch March 16, 2021 00:18

EricMichaelSmith mentioned this pull request Mar 17, 2021

Make MD Gender classifier compatible with Batch refactor #3533

Merged

emilydinan mentioned this pull request Mar 19, 2021

[TGA] rank candidates fix #3541

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit torch_agent.Batch to tensors #3389

Limit torch_agent.Batch to tensors #3389

stephenroller commented Jan 16, 2021 •

edited

Loading

klshuster left a comment

klshuster Mar 9, 2021

klshuster Mar 9, 2021

klshuster Mar 9, 2021

stephenroller Mar 10, 2021

klshuster Mar 11, 2021

klshuster Mar 9, 2021

klshuster Mar 9, 2021

klshuster Mar 9, 2021

stephenroller Mar 10, 2021

klshuster Mar 11, 2021

klshuster Mar 9, 2021

klshuster Mar 9, 2021

stephenroller Mar 10, 2021

klshuster Mar 11, 2021

stephenroller Mar 12, 2021

klshuster Mar 12, 2021

stephenroller Mar 15, 2021

klshuster Mar 9, 2021

EricMichaelSmith left a comment

EricMichaelSmith Mar 9, 2021

Limit torch_agent.Batch to tensors #3389

Limit torch_agent.Batch to tensors #3389

Conversation

stephenroller commented Jan 16, 2021 • edited Loading

klshuster left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

EricMichaelSmith left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stephenroller commented Jan 16, 2021 •

edited

Loading