FIX [`Generation`] Fix some issues when running the MaxLength criteria on CPU #29317

younesbelkada · 2024-02-27T09:26:21Z

What does this PR do?

In TRL we do have a class that call generate with MaxLengthCriteria and running the CI on main seems to cause some issues after #29116 😱
Not sure how to repro apart from running the failing CI and I did not flagged anything alarming outside my CI but to be on the safe zone I propose to init the torch.full tensor in bool directly, I assume it's fine according to the type hint

Here is the traceback:

self = [<transformers.generation.stopping_criteria.MaxLengthCriteria object at 0x7f652ccf8810>]
input_ids = tensor([[31373,   995, 19277],
        [31373,   995, 27455],
        [31373,   995,  4562],
        [31373,   995, 26964]])
scores = None, kwargs = {}, is_done = tensor([False, False, False, False])
criteria = <transformers.generation.stopping_criteria.MaxLengthCriteria object at 0x7f652ccf8810>

    @add_start_docstrings(STOPPING_CRITERIA_INPUTS_DOCSTRING)
    def __call__(self, input_ids: torch.LongTensor, scores: torch.FloatTensor, **kwargs) -> torch.BoolTensor:
        is_done = torch.full((input_ids.shape[0],), False, device=input_ids.device)
        for criteria in self:
>           is_done = is_done | criteria(input_ids, scores, **kwargs)
E           RuntimeError: "bitwise_or_cpu" not implemented for 'Float'

https://github.com/huggingface/trl/actions/runs/8058401333/job/22011132917

younesbelkada · 2024-02-27T09:26:55Z

cc @zucchini-nlp @gante what do you think about these changes ? 🙏 I can also spend some time on trying to repro with a minial snippet but if these changes look good to you perhaps we can move forward with it

PS: and for some reason I can't request review @zucchini-nlp 🤯

HuggingFaceDocBuilderDev · 2024-02-27T09:45:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2024-02-27T14:19:38Z

Okay for me, but it's weird that max_length criteria was returning a float tensor

younesbelkada · 2024-02-27T14:27:52Z

@zucchini-nlp torch.full(xxx) returns a float tensor by default unless you don't explicitly cast it or init the tensor with the desired dtype. The thing I am not sure about is that why it didn't get flagged on other tests

gante

Perfect, thank you for having a look at the issue 🤗

…a on CPU (huggingface#29317) fix the bitwise or issue

…a on CPU (#29317) fix the bitwise or issue

fix the bitwise or issue

4b657f7

younesbelkada requested a review from gante February 27, 2024 09:26

Merge remote-tracking branch 'upstream/main' into fix-dtype-bool

99f8620

gante approved these changes Mar 4, 2024

View reviewed changes

younesbelkada requested a review from ArthurZucker March 4, 2024 09:56

gante requested review from ArthurZucker and removed request for ArthurZucker March 4, 2024 09:56

ArthurZucker approved these changes Mar 5, 2024

View reviewed changes

younesbelkada merged commit 81c8191 into huggingface:main Mar 5, 2024
21 checks passed

younesbelkada deleted the fix-dtype-bool branch March 5, 2024 01:29

damithsenanayake pushed a commit to damithsenanayake/transformers that referenced this pull request Mar 7, 2024

FIX [Generation] Fix some issues when running the MaxLength criteri…

67890e6

…a on CPU (huggingface#29317) fix the bitwise or issue

itazap pushed a commit that referenced this pull request May 14, 2024

FIX [Generation] Fix some issues when running the MaxLength criteri…

f60ac0c

…a on CPU (#29317) fix the bitwise or issue

zucchini-nlp mentioned this pull request Jun 4, 2024

Specify dtype=torch.bool to avoid xla error #31191

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX [`Generation`] Fix some issues when running the MaxLength criteria on CPU #29317

FIX [`Generation`] Fix some issues when running the MaxLength criteria on CPU #29317

younesbelkada commented Feb 27, 2024 •

edited

Loading

younesbelkada commented Feb 27, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 27, 2024

zucchini-nlp commented Feb 27, 2024

younesbelkada commented Feb 27, 2024

gante left a comment

FIX [Generation] Fix some issues when running the MaxLength criteria on CPU #29317

FIX [Generation] Fix some issues when running the MaxLength criteria on CPU #29317

Conversation

younesbelkada commented Feb 27, 2024 • edited Loading

What does this PR do?

younesbelkada commented Feb 27, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Feb 27, 2024

zucchini-nlp commented Feb 27, 2024

younesbelkada commented Feb 27, 2024

gante left a comment

Choose a reason for hiding this comment

FIX [`Generation`] Fix some issues when running the MaxLength criteria on CPU #29317

FIX [`Generation`] Fix some issues when running the MaxLength criteria on CPU #29317

younesbelkada commented Feb 27, 2024 •

edited

Loading

younesbelkada commented Feb 27, 2024 •

edited

Loading