Support list/tuple inputs for special tokens in MultiSegmentPacker layer #1046

abheesht17 · 2023-05-17T00:17:21Z

No description provided.

mattdangerw

looks great! just minor comments

mattdangerw · 2023-05-17T03:58:52Z

keras_nlp/layers/multi_segment_packer.py

@@ -53,12 +53,16 @@ class MultiSegmentPacker(keras.layers.Layer):

    Args:
        sequence_length: The desired output length.


let's add the type notes you have in the other layer for consistency

mattdangerw · 2023-05-17T03:58:59Z

keras_nlp/layers/multi_segment_packer.py

            dtype of the input tensors to the layer.
+        end_value: The id(s) or token(s) that is/are to be placed at the end of


is/are -> are

mattdangerw · 2023-05-17T03:59:09Z

keras_nlp/layers/multi_segment_packer.py

            dtype of the input tensors to the layer.
+        end_value: The id(s) or token(s) that is/are to be placed at the end of
+            the last input segment (called "[SEP]" for BERT). The dtype much


much -> must

mattdangerw · 2023-05-17T03:59:52Z

keras_nlp/layers/multi_segment_packer.py

+        sep_value: The id(s) or token(s) that is/are to be placed at the end of
+            every segment, except the last segment (called "[SEP]" for BERT).
+            If `None`, `end_value` is used. The dtype much match the dtype of
+            the input tensors to the layer.
        pad_value: The id or token that is to be placed into the unused
            positions after the last segment in the sequence
            (called "[PAD]" for BERT).


should we add an example below? maybe roberta double sep?

mattdangerw · 2023-05-17T04:01:33Z

keras_nlp/layers/multi_segment_packer.py

-        start_column = tf.fill((batch_size, 1), start_value)
-        end_column = tf.fill((batch_size, 1), end_value)
-        ones_column = tf.ones_like(start_column, dtype=tf.int32)
+        start_values_tensor = tf.repeat(


these names are a little confusing start_value is already a tensor. should we co back to _column naming?

start_column, end_column, sep_column?

We can, but it isn't exactly a column :P. I'll call it start_columns

mattdangerw · 2023-05-17T04:01:58Z

keras_nlp/layers/multi_segment_packer_test.py

+            (
+                [
+                    [
+                        "[CLS]",


try to come up with a slightly shorter test case that will format the lists to one line

mattdangerw

Very nice! Thank you.

mattdangerw · 2023-05-17T17:13:25Z

/gcbrun

Support list/tuple inputs for special tokens in MultiSegmentPacker layer

27df622

abheesht17 requested a review from mattdangerw May 17, 2023 03:44

mattdangerw requested changes May 17, 2023

View reviewed changes

Address comments

5853210

abheesht17 requested a review from mattdangerw May 17, 2023 10:34

NIT

2ad67df

mattdangerw approved these changes May 17, 2023

View reviewed changes

mattdangerw merged commit add02fe into keras-team:master May 17, 2023

mattdangerw mentioned this pull request Oct 18, 2023

Make Changes to MultiSegmentPacker Layer for RoBERTa #368

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support list/tuple inputs for special tokens in MultiSegmentPacker layer #1046

Support list/tuple inputs for special tokens in MultiSegmentPacker layer #1046

abheesht17 commented May 17, 2023

mattdangerw left a comment

mattdangerw May 17, 2023

mattdangerw May 17, 2023

mattdangerw May 17, 2023

mattdangerw May 17, 2023

mattdangerw May 17, 2023

abheesht17 May 17, 2023

mattdangerw May 17, 2023

abheesht17 May 17, 2023

mattdangerw left a comment

mattdangerw commented May 17, 2023

		@@ -53,12 +53,16 @@ class MultiSegmentPacker(keras.layers.Layer):

		Args:
		sequence_length: The desired output length.

		dtype of the input tensors to the layer.
		end_value: The id(s) or token(s) that is/are to be placed at the end of

Support list/tuple inputs for special tokens in MultiSegmentPacker layer #1046

Support list/tuple inputs for special tokens in MultiSegmentPacker layer #1046

Conversation

abheesht17 commented May 17, 2023

mattdangerw left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattdangerw left a comment

Choose a reason for hiding this comment

mattdangerw commented May 17, 2023