Added an Example for BLEU. #806

Neeshamraghav012 · 2023-03-07T04:56:01Z

I would like to inform you that I have made some updates to the content. Firstly, I have added an example for BLEU which was not present before. This new example serves as an illustration of the concept and should be helpful for those who are new to this topic.

Furthermore, I have also fixed the URLs by wrapping them in markdown. This should make it easier for readers to access the resources mentioned in the content.

I would like to inform you that I have made some updates to the content. Firstly, I have added an example for BLEU which was not present before. This new example serves as an illustration of the concept and should be helpful for those who are new to this topic. Furthermore, I have also fixed the URLs by wrapping them in markdown. This should make it easier for readers to access the resources mentioned in the content.

Neeshamraghav012 · 2023-03-07T04:57:41Z

@abheesht17 I have deleted the previous pull request because I messed up with it. Please review this one.

abheesht17

Left a few comments. Most of them our minor, but there is one major comment regarding rank of inputs!

abheesht17 · 2023-03-08T03:16:28Z

keras_nlp/metrics/bleu.py

@@ -4,7 +4,7 @@
 # you may not use this file except in compliance with the License.
 # You may obtain a copy of the License at
 #
-# https://www.apache.org/licenses/LICENSE-2.0
+#  https://www.apache.org/licenses/LICENSE-2.0


Please remove extra spaces.

abheesht17 · 2023-03-08T03:19:11Z

keras_nlp/metrics/bleu.py

+ >>> bleu = keras_nlp.metrics.Bleu(max_order=4)
+ >>> ref_sentence = "the quick brown fox jumps over the lazy dog"
+ >>> pred_sentence = "the quick brown fox jumps over the box"
+ >>> score = bleu([ref_sentence], [pred_sentence])


Since we mention that this example is with Python string inputs, this should ideally be

bleu([ref_sentence], pred_sentence)

abheesht17 · 2023-03-08T03:19:55Z

keras_nlp/metrics/bleu.py

+ "the quick brown fox jumps over the lazy frog"
+ ]
+ >>> pred_sentence = ["the quick brown fox jumps over the box"]
+ >>> score = bleu(ref_sentence, pred_sentence)


Instead of assigning the score to a variable, let's just have

>>> bleu(ref_sentence, pred_sentence) <tf.Tensor(0.7420885, shape=(), dtype=float32)>

Please do the same in other places as well.

Although the above example works, the ideal input formula is:

rank(ref_sentence) = rank(pred_sentence) + 1

I think we should probably stick to this formula in our examples. Otherwise, it becomes confusing for the reader. Could you please make this change everywhere?

Although the above example works, the ideal input formula is:

rank(ref_sentence) = rank(pred_sentence) + 1

I think we should probably stick to this formula in our examples. Otherwise, it becomes confusing for the reader. Could you please make this change everywhere?

Correct me if I am wrong, for this, do I need to change blue(ref_sentence, pred_sentence) to bleu([ref_sentence], pred_sentence)?

abheesht17 · 2023-03-08T03:36:52Z

keras_nlp/metrics/bleu.py

+ c. RaggedTensor.
+ >>> bleu = keras_nlp.metrics.Bleu(max_order=4)
+ >>> ref_sentence = tf.ragged.constant([
+ [
+ "the quick brown fox jumps over the lazy dog",
+ "the quick brown fox jumps over the lazy frog"
+ ]
+ ])
+ >>> pred_sentence = tf.ragged.constant([
+ ["the quick brown fox jumps over the box"]
+ ])
+ >>> score = bleu(ref_sentence, pred_sentence)
+ <tf.Tensor(0.7420885, shape=(), dtype=float32)>


Since we mention that we are using ragged tensors here, it would be great if we could actually have a tensor with variable dimension along an axis. Also, I think your example will throw an error; pred_sentence should be a dense tensor here, not a ragged tensor.

So, something like:

>>> bleu = keras_nlp.metrics.Bleu(max_order=4) >>> ref_sentence = tf.ragged.constant([ ... ["the quick brown fox jumps over the lazy dog"], ... ["the quick brown fox jumps over the lazy dog", "the quick brown fox jumps over the lazy frog"] ... ]) >>> pred_sentence = tf.constant([ ... ["the quick brown fox jumps over the box"], ... ["the quick brown fox jumps over the box"] ... ]) >>> bleu(ref_sentence, pred_sentence) <tf.Tensor: shape=(), dtype=float32, numpy=0.7420885>

abheesht17 · 2023-03-08T03:40:16Z

keras_nlp/metrics/bleu.py

+ >>> score = bleu([ref_sentence], [pred_sentence])
+ <tf.Tensor(0.7420885, shape=(), dtype=float32)>
+
+ 1.2. rank 1 inputs.


We should change these to

rank 1 prediction inputs, rank 2 reference inputs

or something like that?

I checked the Rouge L metric documentation, where this same convention is used. Is it okay to make these changes?

abheesht17 · 2023-03-08T03:43:24Z

@mattdangerw - for context, this is the previous PR: #799

Minor improvements are done!

Neeshamraghav012 · 2023-03-27T17:32:22Z

Hi @abheesht17 and @mattdangerw, I made the minor changes as you suggested. Thank you for your feedback!

However, I am not sure about the part where you mentioned changing the rank of inputs. Could you please provide more details or clarify what you meant by that?

abheesht17 suggested changes Mar 8, 2023

View reviewed changes

abheesht17 requested review from abheesht17 and mattdangerw and removed request for abheesht17 March 8, 2023 03:42

mattdangerw assigned abheesht17 and mattdangerw Mar 22, 2023

Update bleu.py

b5df504

Minor improvements are done!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added an Example for BLEU. #806

Added an Example for BLEU. #806

Neeshamraghav012 commented Mar 7, 2023

Neeshamraghav012 commented Mar 7, 2023

abheesht17 left a comment

abheesht17 Mar 8, 2023

abheesht17 Mar 8, 2023

abheesht17 Mar 8, 2023

abheesht17 Mar 8, 2023

Neeshamraghav012 Mar 10, 2023 •

edited

Loading

abheesht17 Mar 8, 2023

abheesht17 Mar 8, 2023

Neeshamraghav012 Mar 10, 2023

abheesht17 commented Mar 8, 2023

Neeshamraghav012 commented Mar 27, 2023

Added an Example for BLEU. #806

Are you sure you want to change the base?

Added an Example for BLEU. #806

Conversation

Neeshamraghav012 commented Mar 7, 2023

Neeshamraghav012 commented Mar 7, 2023

abheesht17 left a comment

Choose a reason for hiding this comment

abheesht17 Mar 8, 2023

Choose a reason for hiding this comment

abheesht17 Mar 8, 2023

Choose a reason for hiding this comment

abheesht17 Mar 8, 2023

Choose a reason for hiding this comment

abheesht17 Mar 8, 2023

Choose a reason for hiding this comment

Neeshamraghav012 Mar 10, 2023 • edited Loading

Choose a reason for hiding this comment

abheesht17 Mar 8, 2023

Choose a reason for hiding this comment

abheesht17 Mar 8, 2023

Choose a reason for hiding this comment

Neeshamraghav012 Mar 10, 2023

Choose a reason for hiding this comment

abheesht17 commented Mar 8, 2023

Neeshamraghav012 commented Mar 27, 2023

Neeshamraghav012 Mar 10, 2023 •

edited

Loading