Add scores for generated text in inference mode #164

allen-q · 2020-08-20T11:43:10Z

Background:

I was using the T5 model and wanted to get the scores at inference mode along with the generated text. However, this feature is not supported by T5 at the moment and I was advised to implement this feature and raise a pull request. Please see google-research/text-to-text-transfer-transformer#311. for more details.

This PR implemented this function to add the scores(log likelihood) along the generated text in the outputs when a model is exported in SavedModel format.

Changed file:
./mesh/mesh_tensorflow/transformer/utils.py

SignatureDef Diff
Below is how a T5 MTF SavedModel SignatureDef looks like before the change:

The given SavedModel SignatureDef contains the following input(s):
  inputs['input'] tensor_info:
      dtype: DT_STRING
      shape: (-1)
      name: inputs:0
The given SavedModel SignatureDef contains the following output(s):
  outputs['inputs'] tensor_info:
      dtype: DT_STRING
      shape: (10)
      name: SentenceTokenizer/SentenceTokenizer/SentencepieceDetokenizeOp:0
  outputs['outputs'] tensor_info:
      dtype: DT_STRING
      shape: (10)
      name: SentenceTokenizer_1/SentenceTokenizer/SentencepieceDetokenizeOp:0
Method name is: tensorflow/serving/predict

Below is how a T5 MTF SavedModel SignatureDef looks like after the change:

The given SavedModel SignatureDef contains the following input(s):
  inputs['input'] tensor_info:
      dtype: DT_STRING
      shape: (-1)
      name: inputs:0
The given SavedModel SignatureDef contains the following output(s):
  outputs['inputs'] tensor_info:
      dtype: DT_STRING
      shape: (10)
      name: SentenceTokenizer/SentenceTokenizer/SentencepieceDetokenizeOp:0
  outputs['outputs'] tensor_info:
      dtype: DT_STRING
      shape: (10)
      name: SentenceTokenizer_1/SentenceTokenizer/SentencepieceDetokenizeOp:0
  outputs['scores'] tensor_info:
      dtype: DT_FLOAT
      shape: (10)
      name: reshape_17/parallel_0/Reshape:0
Method name is: tensorflow/serving/predict

googlebot · 2020-08-31T10:46:30Z

We found a Contributor License Agreement for you (the sender of this pull request), but were unable to find agreements for all the commit author(s) or Co-authors. If you authored these, maybe you used a different email address in the git commits than was used to sign the CLA (login here to double check)? If these were authored by someone else, then they will need to sign a CLA as well, and confirm that they're okay with these being contributed to Google.
In order to pass this check, please resolve this problem and then comment @googlebot I fixed it.. If the bot doesn't comment, it means it doesn't think anything has changed.

ℹ️ Googlers: Go here for more info.

googlebot · 2020-08-31T11:00:06Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

…nsistent with other scores. create a compute_score function to remove duplicate code.

mesh_tensorflow/transformer/utils.py

adarob

Actually, this is going to result in an approximate doubling of inference time. Can you make it so the score is computed in sample_autoregressive?

adarob · 2020-10-02T14:06:25Z

@allen-q do you plan on following up with this? thanks!

allen-q · 2020-10-06T11:04:01Z

Sorry for the late reply. I'm still planning to make it work but probably won't have time in the next couple of weeks. Happy for someone to take a look at it in the meantime.

…

On Sat, 3 Oct 2020, 12:06 am Adam Roberts, ***@***.***> wrote: @allen-q <https://github.com/allen-q> do you plan on following up with this? thanks! — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#164 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AE6GK65MHIRVRNBEVT533DTSIXM7DANCNFSM4QF6TH5A> .

adarob · 2020-10-16T13:54:05Z

No worries. Perhaps we can just gate this with a bool arg for now until we have the "free" version?

marton-avrios · 2020-12-10T16:40:37Z

I am currently working on a return_logits option for sample_autoregressive that just returns the already available logits together with outputs so no extra computation is involved. If this is set to True it returns an (outputs, output_logits) tuple instead of outputs. But I think it makes no sense to return only outputs anymore so not sure another argument should be introduced.

added probabilities for generated text in inference mode

7a96222

allen-q mentioned this pull request Aug 20, 2020

Add probabilities for generated text in inference model #163

Closed

allen-q force-pushed the Add_probability_in_inference_mode branch from a3d42a3 to 7a96222 Compare August 31, 2020 11:00

change probabilities to scores(log likelihood) in the output to be co…

326c8e4

…nsistent with other scores. create a compute_score function to remove duplicate code.

allen-q changed the title ~~Add probabilities for generated text in inference model~~ Add scores for generated text in inference model Aug 31, 2020

allen-q changed the title ~~Add scores for generated text in inference model~~ Add scores for generated text in inference mode Aug 31, 2020

daphnei reviewed Sep 7, 2020

View reviewed changes

mesh_tensorflow/transformer/utils.py Outdated Show resolved Hide resolved

allen-q added 3 commits September 8, 2020 20:04

update function compute_score to compute_scores

7161da2

Resolve merge conflict.

228fc92

Merge branch 'master' into Add_probability_in_inference_mode

ff6289c

adarob approved these changes Sep 14, 2020

View reviewed changes

adarob added the cla: yes label Sep 14, 2020

adarob requested changes Sep 14, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add scores for generated text in inference mode #164

Add scores for generated text in inference mode #164

allen-q commented Aug 20, 2020 •

edited

Loading

googlebot commented Aug 31, 2020

googlebot commented Aug 31, 2020

adarob left a comment

adarob commented Oct 2, 2020

allen-q commented Oct 6, 2020 via email

adarob commented Oct 16, 2020

marton-avrios commented Dec 10, 2020 •

edited

Loading

Add scores for generated text in inference mode #164

Are you sure you want to change the base?

Add scores for generated text in inference mode #164

Conversation

allen-q commented Aug 20, 2020 • edited Loading

googlebot commented Aug 31, 2020

googlebot commented Aug 31, 2020

adarob left a comment

Choose a reason for hiding this comment

adarob commented Oct 2, 2020

allen-q commented Oct 6, 2020 via email

adarob commented Oct 16, 2020

marton-avrios commented Dec 10, 2020 • edited Loading

allen-q commented Aug 20, 2020 •

edited

Loading

marton-avrios commented Dec 10, 2020 •

edited

Loading