Correct the new defaults #34377

Cyrilvallez · 2024-10-24T13:22:49Z

What does this PR do?

This corrects the new defaults set in #34026. The previous PR had the effect that if max_length is explicitly set by the user in generate, it is automatically overriden to a new value which is extremely confusing and not wanted.
This adds a check to avoid this scenario.

Cyrilvallez · 2024-10-24T13:43:07Z

cc @ArthurZucker

HuggingFaceDocBuilderDev · 2024-10-24T14:31:39Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

ah nice catch!

…ngth

gante

Makes sense 👍

* Correct the new defaults * CIs * add check * Update utils.py * Update utils.py * Add the max_length in generate test checking shape without passing length * style * CIs * fix fx CI issue

* Support BatchNorm in Hubert pos_conv_emb as in fairseq * Correct the new defaults (#34377) * Correct the new defaults * CIs * add check * Update utils.py * Update utils.py * Add the max_length in generate test checking shape without passing length * style * CIs * fix fx CI issue * [auto. ping] Avoid sending empty info + add more team members (#34383) * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * Fix glm (#34388) * Fix duplicated * fix import * Use non nested images and batched text Idefics2/3 (#34222) * add support for non nested images and add tests * add tests error scenario * fix style * added single and no image to error tests * Fix onnx non-expotable inplace aten op (#34376) * fix onnx non-expotable inplace op * mistral, qwen2, qwen2_vl, starcoder2 * fixup copies * Fix right padding in LLaVA models (#34305) * fix right pad llavas * device mismatch * no filter (#34391) * no filter * no filter * no filter --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * SynthID: better example (#34372) * better example * Update src/transformers/generation/configuration_utils.py * Update src/transformers/generation/logits_process.py * nits * Tests: upgrade `test_eager_matches_sdpa_generate` (#34386) * Fix bnb training test failure (#34414) * Fix bnb training test: compatibility with OPTSdpaAttention * Avoid check expected exception when it is on CUDA (#34408) * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * Fix typos in agents_advanced.md (#34405) * [docs] Cache implementations (#34325) cache * [run-slow] hubert * Support BatchNorm in Hubert pos_conv_emb as in fairseq Add conversion integration test, and make batchnorm explicit variable * Support BatchNorm in Hubert pos_conv_emb as in fairseq fix make fixup styling changes * [run-slow] hubert * Support BatchNorm in Hubert pos_conv_emb as in fairseq * [run-slow] hubert * Support BatchNorm in Hubert pos_conv_emb as in fairseq Add conversion integration test, and make batchnorm explicit variable * Support BatchNorm in Hubert pos_conv_emb as in fairseq fix make fixup styling changes * [run-slow] hubert * [run-slow] hubert --------- Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com> Co-authored-by: Raushan Turganbay <raushan@huggingface.co> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> Co-authored-by: Rudy Delouya <rudy.delouya@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

Cyrilvallez marked this pull request as ready for review October 24, 2024 13:42

Cyrilvallez added 4 commits October 24, 2024 15:47

Correct the new defaults

59e15f3

CIs

83df64b

add check

88d0404

Update utils.py

408a2f9

Cyrilvallez force-pushed the new-defaults branch from bb674d8 to 408a2f9 Compare October 24, 2024 13:47

Update utils.py

74bec90

ArthurZucker approved these changes Oct 24, 2024

View reviewed changes

Cyrilvallez added 2 commits October 24, 2024 17:40

Add the max_length in generate test checking shape without passing le…

89d4e55

…ngth

style

65f9495

gante approved these changes Oct 24, 2024

View reviewed changes

Cyrilvallez added 2 commits October 24, 2024 18:01

CIs

a04e288

fix fx CI issue

574f2f6

ArthurZucker merged commit 4c6e0c9 into main Oct 24, 2024
22 of 26 checks passed

ArthurZucker deleted the new-defaults branch October 24, 2024 16:42

techkang mentioned this pull request Oct 25, 2024

enable average tokens across devices #34373

Merged

5 tasks

ydshieh mentioned this pull request Nov 19, 2024

Fix Whisper CI #34617

Merged

zucchini-nlp mentioned this pull request Dec 2, 2024

fix test_generated_length_assisted_generation #34935

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correct the new defaults #34377

Correct the new defaults #34377

Cyrilvallez commented Oct 24, 2024 •

edited

Loading

Cyrilvallez commented Oct 24, 2024

HuggingFaceDocBuilderDev commented Oct 24, 2024

ArthurZucker left a comment

gante left a comment

Correct the new defaults #34377

Correct the new defaults #34377

Conversation

Cyrilvallez commented Oct 24, 2024 • edited Loading

What does this PR do?

Cyrilvallez commented Oct 24, 2024

HuggingFaceDocBuilderDev commented Oct 24, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

gante left a comment

Choose a reason for hiding this comment

Cyrilvallez commented Oct 24, 2024 •

edited

Loading