Skip to content

Latest commit

 

History

History
64 lines (56 loc) · 1.99 KB

Readme.md

File metadata and controls

64 lines (56 loc) · 1.99 KB

Sellam et al. (2020)

Publication

BLEURT: Learning Robust Metrics for Text Generation

Repositories

https://github.com/google-research/bleurt

Available Models

The BLEURT class can be instantiated with the checkpoints provided by the original repository. See here for the list. The corresponding model names are "bleurt-{tiny,base,large}-{128,512}" and should be passed to the constructor of the class.

  • BLEURT
    • Description: A learned evaluation metric for natural language generation
    • Name: sellam2020-bleurt
    • Usage:
      from repro.models.sellam2020 import BLEURT
      model = BLEURT(model="bleurt-base-128")
      inputs = [
          {"candidate": "The candidate text", "references": ["The reference"]}
      ]
      scores = model.predict_batch(inputs)

Implementation Notes

  • BLEURT only supports single references (the argument to references should be a list of length 1).

Docker Information

  • Image name: sellam2020
  • Build command:
    repro setup sellam2020 \
      [--not-tiny-128] \
      [--not-base-128] \
      [--tiny-512] \
      [--base-512] \
      [--large-128] \
      [--large-512] \
      [--silent]
    The arguments specify which BLEURT models should be downloaded. Both bleurt-tiny-128 and bleurt-base-128 are downloaded by default.
  • Requires network: No

Testing

Explain how to run the unittests for this model

repro setup sellam2020
pytest models/sellam2020/tests

Status

  • Regression unit tests pass
  • Correctness unit tests pass
    The unit tests are based on examples in the official repository. See here.
  • Model runs on full test dataset
    Not tested
  • Predictions approximately replicate results reported in the paper
    Not tested
  • Predictions exactly replicate results reported in the paper
    Not tested