Generate: consistently handle special tokens as tensors #30624

gante · 2024-05-02T15:31:07Z

What does this PR do?

(reopened from #29788, requirements were merged and made this PR simpler)

To enable torch.compile with generate, some special token-related operations have to be rewritten into torch operations. That requires special tokens to be tensors instead of integers or a list of integers. (See #29374 for a working prototype)

This PR reworks special token usage in generate to consistently treat them as a tensor, as opposed to e.g. keeping track of eos_token_id in integer and in tensor form.

👉 Review suggestion: start by reading _prepare_special_tokens and how it fits in generate.

Tests ran locally:

logits processors doctests (pytest --doctest-modules src/transformers/generation/logits_process.py -vv)
generate doctests (pytest --doctest-modules src/transformers/generation/utils.py -vv)
generate integration tests (RUN_SLOW=1 py.test tests/generation/ -vv)
cache integration tests (RUN_SLOW=1 py.test tests/test_cache_utils.py -vv)
llama slow tests (RUN_SLOW=1 py.test tests/models/llama/test_modeling_llama.py -vv)
whisper slow tests (RUN_SLOW=1 py.test tests/models/whisper/test_modeling_whisper.py -vv) -- same failures as in main

HuggingFaceDocBuilderDev · 2024-05-02T15:50:52Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

gante · 2024-05-03T12:12:25Z

cc @zucchini-nlp this one is the same as #29788, which you've already reviewed in its early state. Note that because we've merged other PRs first (e.g. removing the decoding functions from the public API), the diff is much smaller 💛

ArthurZucker

Overall good. Don't think we should warn but error out, and maybe update the serialize / de-serialize?

src/transformers/generation/logits_process.py

src/transformers/generation/utils.py

tests/models/seamless_m4t/test_modeling_seamless_m4t.py

src/transformers/generation/utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

gante · 2024-05-09T10:11:23Z

@ArthurZucker addressed your comments :D

(let's see this fast CI going brr)

ArthurZucker

Looks good thanks for updating

ArthurZucker · 2024-05-09T13:02:07Z

tests/models/seamless_m4t/test_modeling_seamless_m4t.py

            torch.zeros(input_ids.shape[:2], dtype=torch.int64, layout=input_ids.layout, device=input_ids.device)
-            + model._get_decoder_start_token_id()
+            + generation_config.decoder_start_token_id


do we have to do the + and not just use the decoder start toekn id?

I think the + here makes it a tensor of all decoder_start_token_id (as opposed to concatenation).

…30624) * tmp commit * [test_all] mvp * missing not * [test_all] final test fixes * fix musicgen_melody and rag * [test_all] empty commit * PR comments * Update src/transformers/generation/utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* tmp commit * [test_all] mvp * missing not * [test_all] final test fixes * fix musicgen_melody and rag * [test_all] empty commit * PR comments * Update src/transformers/generation/utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

….41.1 Fixes #31. The handling of special tokens in `transformers` was changed in huggingface/transformers#30624 and huggingface/transformers#30746. This updates the XTTS streaming code accordingly.

gante force-pushed the special_tokens branch from f035f0b to 0a659be Compare May 3, 2024 11:47

gante requested a review from ArthurZucker May 3, 2024 12:07

ArthurZucker reviewed May 6, 2024

View reviewed changes

gante force-pushed the special_tokens branch from d167e18 to f2c4d74 Compare May 6, 2024 12:37

gante added 6 commits May 9, 2024 09:26

tmp commit

956c0b8

[test_all] mvp

b5bebe9

missing not

0f6ac58

[test_all] final test fixes

64a3c99

fix musicgen_melody and rag

20a06aa

[test_all] empty commit

0dcaa3d

gante force-pushed the special_tokens branch from f2c4d74 to 0dcaa3d Compare May 9, 2024 09:27

gante and others added 2 commits May 9, 2024 10:10

PR comments

12c53ab

Update src/transformers/generation/utils.py

3069c7f

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

gante requested a review from ArthurZucker May 9, 2024 10:11

derp

19644da

ArthurZucker approved these changes May 9, 2024

View reviewed changes

gante merged commit 7130a22 into huggingface:main May 9, 2024
23 checks passed

gante deleted the special_tokens branch May 9, 2024 17:02

gante mentioned this pull request May 20, 2024

Generation: get special tokens from model config #30899

Merged

This was referenced May 28, 2024

[Bug] HF transformers update breaks XTTS streaming idiap/coqui-ai-TTS#31

Closed

isin() received an invalid combination of arguments #31040

Closed

amazingvince mentioned this pull request Jun 1, 2024

Huggingface changed some utils function in generate. EGjoni/DRUGS#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate: consistently handle special tokens as tensors #30624

Generate: consistently handle special tokens as tensors #30624

gante commented May 2, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented May 2, 2024

gante commented May 3, 2024

ArthurZucker left a comment

gante commented May 9, 2024 •

edited

Loading

ArthurZucker left a comment

ArthurZucker May 9, 2024

gante May 9, 2024

Generate: consistently handle special tokens as tensors #30624

Generate: consistently handle special tokens as tensors #30624

Conversation

gante commented May 2, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented May 2, 2024

gante commented May 3, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

gante commented May 9, 2024 • edited Loading

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker May 9, 2024

Choose a reason for hiding this comment

gante May 9, 2024

Choose a reason for hiding this comment

gante commented May 2, 2024 •

edited

Loading

gante commented May 9, 2024 •

edited

Loading