Custom beam search scorer argument in generate function #32097

GM07 · 2024-07-19T16:58:10Z

What does this PR do?

Added an argument beam_search_scorer_class in generate() function to allow users to use beam search and grouped beam search using a custom beam search scorer.

Before that, the internal methods _beam_search() and _group_beam_search() had to be used to pass in a custom beam search scorer. However, this caused a lot of problems since the generate() function does a lot of preprocessing on the input (interleave the input_ids for example in the case of beam search). All that preprocessing had to be done manually Right now, this can be set directly in the generate() function by passing the type of the beam search scorer in the following way :

tokenizer = AutoTokenizer.from_pretrained("google-t5/t5-base")
model = AutoModelForSeq2SeqLM.from_pretrained("google-t5/t5-base")

encoder_input_str = "translate English to German: How old are you?"
encoder_input_ids = tokenizer(encoder_input_str, return_tensors="pt").input_ids

class CustomBeamSearchScorer(BeamSearchScorer):
    finalize_called = False
    process_called = False

    def __init__(self, test_args, *args, **kwargs):
        super().__init__(*args, **kwargs)
        self.test_args = test_args

    def process(self, *args, **kwargs):
        results = super().process(*args, **kwargs)
        # Do stuff
        return results

    def finalize(self, *args, **kwargs):
        results = super().finalize(*args, **kwargs)
        # Do stuff
        return results

# lets run beam search using 3 beams
num_beams = 3
# define decoder start token ids
input_ids = torch.ones((1, 1), device=model.device, dtype=torch.long)
input_ids = input_ids * model.config.decoder_start_token_id

# add encoder_outputs to model keyword arguments
model_kwargs = {"encoder_outputs": model.get_encoder()(encoder_input_ids, return_dict=True)}

outputs = model.generate(
    input_ids,
    num_beams=num_beams,
    min_length=5,
    eos_token_id=model.config.eos_token_id,
    beam_search_scorer_class=CustomBeamSearchScorer,
    beam_search_scorer_args={"test_args": True},
    **model_kwargs,
)

Why was this done that way

Initially, the beam_search_scorer was simply an object instead of the type. However, this could lead to inconsistencies between the parameters of the generation config and the beam search scorer. For example, the number of beams could have been set to 2 in the generate() method, but set to 4 when creating the scorer. By passing the type only, this allows the scorer to be created with the generation config inside the method (like before) and preventing any inconsistencies between the two objects.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@gante
@ArthurZucker

ArthurZucker · 2024-08-03T16:31:17Z

cc @zucchini-nlp and @gante

zucchini-nlp

Hi @GM07 !

We're currently refactoring generate and beam methods and would like to not introduce new changes unless the refactoring is done. Tracker for refactoring can be found at #30810

gante · 2025-01-30T16:02:59Z

Hi @GM07 👋

Custom beam scoring will be enabled, but in a different format :) Beam search is being refactored first in #35802 , then we can add much simpler scoring functions

(This PR will not be accepted in its current form, but contributions for the new format are welcome 🤗 )

amyeroberts added the Generation label Jul 19, 2024

GM07 force-pushed the generate-with-custom-beam-search-scorer branch from 6c6e18a to d23a29a Compare July 19, 2024 18:43

GM07 added 2 commits July 22, 2024 14:27

Adding arg for custom beam search scorer in generate()

5351f0f

Adding support to pass arguments to custom beam search scorer

5437234

GM07 force-pushed the generate-with-custom-beam-search-scorer branch from d23a29a to 5437234 Compare July 22, 2024 18:27

zucchini-nlp reviewed Aug 5, 2024

View reviewed changes

gante mentioned this pull request Jan 30, 2025

[generate] ✨ vectorized beam search ✨ #35802

Open

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom beam search scorer argument in generate function #32097

Custom beam search scorer argument in generate function #32097

GM07 commented Jul 19, 2024 •

edited

Loading

ArthurZucker commented Aug 3, 2024

zucchini-nlp left a comment

gante commented Jan 30, 2025 •

edited

Loading

Custom beam search scorer argument in generate function #32097

Are you sure you want to change the base?

Custom beam search scorer argument in generate function #32097

Conversation

GM07 commented Jul 19, 2024 • edited Loading

What does this PR do?

Why was this done that way

Before submitting

Who can review?

ArthurZucker commented Aug 3, 2024

zucchini-nlp left a comment

Choose a reason for hiding this comment

gante commented Jan 30, 2025 • edited Loading

GM07 commented Jul 19, 2024 •

edited

Loading

gante commented Jan 30, 2025 •

edited

Loading