Add musicgen melody #3

ylacombe · 2024-03-22T08:56:44Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

* Update ms_deform_attn_cuda.cu * Update ms_deform_attn_cuda.cuh * Update modeling_deformable_detr.py * Update src/transformers/models/deformable_detr/modeling_deformable_detr.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update modeling_deformable_detr.py * python utils/check_copies.py --fix_and_overwrite * Fix dtype missmatch error * Update test_modeling_deformable_detr.py * Update test_modeling_deformable_detr.py * Update modeling_deformable_detr.py * Update modeling_deformable_detr.py * Support DeformableDETR with bfloat16 * Add test code * Use AT_DISPATCH_FLOATING_TYPES_AND2 Use AT_DISPATCH_FLOATING_TYPES_AND2 * Update tests/models/deformable_detr/test_modeling_deformable_detr.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update tests/models/deformable_detr/test_modeling_deformable_detr.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Fix not found require_torch_bf16 function --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* First draft * More improvements * More improvements * More fixes * Fix copies * More improvements * More fixes * More improvements * Convert checkpoint * More improvements, set up tests * Fix more tests * Add UdopModel * More improvements * Fix equivalence test * More fixes * Redesign model * Extend conversion script * Use real inputs for conversion script * Add image processor * Improve conversion script * Add UdopTokenizer * Add fast tokenizer * Add converter * Update README's * Add processor * Add fully fledged tokenizer * Add fast tokenizer * Use processor in conversion script * Add tokenizer tests * Fix one more test * Fix more tests * Fix tokenizer tests * Enable fast tokenizer tests * Fix more tests * Fix additional_special_tokens of fast tokenizer * Fix tokenizer tests * Fix more tests * Fix equivalence test * Rename image to pixel_values * Rename seg_data to bbox * More renamings * Remove vis_special_token * More improvements * Add docs * Fix copied from * Update slow tokenizer * Update fast tokenizer design * Make text input optional * Add first draft of processor tests * Fix more processor tests * Fix decoder_start_token_id * Fix test_initialization * Add integration test * More improvements * Improve processor, add test * Add more copied from * Add more copied from * Add more copied from * Add more copied from * Remove print statement * Update README and auto mapping * Delete files * Delete another file * Remove code * Fix test * Fix docs * Remove asserts * Add doc tests * Include UDOP in exotic model tests * Add expected tesseract decodings * Add sentencepiece * Use same design as T5 * Add UdopEncoderModel * Add UdopEncoderModel to tests * More fixes * Fix fast tokenizer * Fix one more test * Remove parallelisable attribute * Fix copies * Remove legacy file * Copy from T5Tokenizer * Fix rebase * More fixes, copy from T5 * More fixes * Fix init * Use ArthurZ/udop for tests * Make all model tests pass * Remove UdopForConditionalGeneration from auto mapping * Fix more tests * fixups * more fixups * fix the tokenizers * remove un-necessary changes * nits * nits * replace truncate_sequences_boxes with truncate_sequences for fix-copies * nit current path * add a test for input ids * ids that we should get taken from c9f7a32 * nits converting * nits * apply ruff * nits * nits * style * fix slow order of addition * fix udop fast range as well * fixup * nits * Add docstrings * Fix gradient checkpointing * Update code examples * Skip tests * Update integration test * Address comment * Make fixup * Remove extra ids from tokenizer * Skip test * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update year * Address comment * Address more comments * Address comments * Add copied from * Update CI * Rename script * Update model id * Add AddedToken, skip tests * Update CI * Fix doc tests * Do not use Tesseract for the doc tests * Remove kwargs * Add original inputs * Update casting * Fix doc test * Update question * Update question * Use LayoutLMv3ImageProcessor * Update organization * Improve docs * Update forward signature * Make images optional * Remove deprecated device argument * Add comment, add add_prefix_space * More improvements * Remove kwargs --------- Co-authored-by: ArthurZucker <arthur.zucker@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

…29310) * torchscript and trainer md es translation * corrected md es files and even corrected spelling in en md * made es corrections to trainer.md * deleted entrenamiento... title on yml * placed entrenamiento in right place

…a on CPU (huggingface#29317) fix the bitwise or issue

* added exllama kernels support for awq models * doc * style * Update src/transformers/modeling_utils.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * refactor * moved exllama post init to after device dispatching * bump autoawq version * added exllama test * style * configurable exllama kernels * copy exllama_config from gptq * moved exllama version check to post init * moved to quantization dockerfile --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* fix mal_length for blip * update also min length * fixes * add a comment * Update src/transformers/models/instructblip/modeling_instructblip.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/models/blip_2/modeling_blip_2.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * make fixup * fix length when user passed * remove else * remove brackets --------- Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

Update starcoder2 paper link

* use torch_device * Update tests/pipelines/test_pipelines_text_generation.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix style --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* update * ... * nits * arf * 🧼 * beat the last guy * style everyone

* style * revert with RP * nit * exact revert

* fix udop imports * sort imports

)

* initial-commit * start cleaning * small nits * small nits * current updates * add kernels * small refactoring little step * add comments * styling * nit * nits * Style * Small changes * Push dummy mambda simple slow * nit * Use original names * Use original names and remove norm * Updates for inference params * Style nd updates * nits * Match logits * Add a test * Add expected generated text * nits doc, imports and styling * style * oups * dont install kernels, invite users to install the required kernels * let use use the original packages * styling * nits * fix some copieds * update doc * fix-copies * styling done * nits * fix import check * run but wrong cuda ress * mamba CUDA works :) * fix the fast path * config naming nits * conversion script is not required at this stage * finish fixing the fast path: generation make sense now! * nit * Let's start working on the CIs * style * better style * more nits * test nit * quick fix for now * nits * nit * nit * nit * nits * update test rest * fixup * update test * nit * some fixes * nits * update test values * fix styling * nit * support peft * integrations tests require torchg * also add slow markers * styling * chose forward wisely * nits * update tests * fix gradient checkpointing * fixup * nit * fix doc * check copies * fix the docstring * fix some more tests * style * fix beam search * add init schene * update * nit * fix * fixup the doc * fix the doc * fixup * tentative update but slow is no longer good * nit * should we always use float32? * nits * revert wrong changes * res in float32 * cleanup * skip fmt for now * update generation values * update test values running original model * fixup * update tests + rename inference_params to cache_params + make sure training does not use cache_params * small nits * more nits * fix final CIs * style * nit doc * I hope final doc nits * nit * 🫠 * final touch! * fix torch import * Apply suggestions from code review Co-authored-by: Lysandre Debut <hi@lysand.re> * Apply suggestions from code review * fix fix and fix * fix base model prefix! * nit * Update src/transformers/models/mamba/__init__.py * Update docs/source/en/model_doc/mamba.md Co-authored-by: Lysandre Debut <hi@lysand.re> * nit --------- Co-authored-by: Lysandre Debut <hi@lysand.re>

…29041) * Fix bug with passing capture_* args to neptune callback * ruff happy? * instantiate (frozen)set only once * code review * code review 2 * ruff happy? * code review

* Update to pull function from proper lib * Fix ruff formatting error * Remove accidently added file

…e#29390) * Automatic safetensors conversion when lacking these files * Remove debug * Thread name * Typo * Ensure that raises do not affect the main thread

* [i18n-zh] Translate add_new_pipeline.md into Chinese * apply suggestions from Fan-Lin

…e#29086) * Update ko _toctree.yml * Create ko: generation_strategies.md * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

… were given (huggingface#29457) * use require_torch_gpu * enable on XPU * fix

* add docs on exllamav2 + AWQ * Update docs/source/en/quantization.md

* add accelerate docs * Apply suggestions from code review Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com> * Update starcoder2.md * add correct generation --------- Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

…fetch_factor (huggingface#29447) * Fix TrainingArguments regression with torch <2.0.0 for dataloader_prefetch_factor dataloader_prefetch_factor was added to TrainingArguments in huggingface#28498 with the default value None, but versions of torch<2.0.0 do not accept None and will raise an error if num_workers == 0 and prefetch_factor != 2 * Add is_torch_available() check * Use is_torch_greater_or_equal_than_2_0 add back check for dataloader_prefetch_factor

…#29462)

…uggingface#29441)

* Fix test failure * use item

…ace#29443)

) * added the max_matching_ngram_size parameter into the GenerationConfig, for the PromptLookupCandidateGenerator * switched back to keyword arguments * added PromptLookupCandidateGenerator docstring for its parameters * ruff reformat * Update src/transformers/generation/configuration_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

Add newly added models to all README files. Also fix one relative path in README_ru.md.

… saved on TPU (huggingface#29388) * Fix for saving ad apter weights when using PEFT * Change supported-classes to PushToHubMixin

Manually call sync step

* add arg --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

…9467)

…a dict of templates (huggingface#29658) * Allow apply_chat_template to pass kwargs to the template * Fix priority for template_kwargs * Fix docstring * style fix * Add the option for the model to have a dict of templates * Error message cleanup * Add test for chat template dicts * Simplify the chat template dict test and apply it to all tokenizers in self.get_tokenizers() * Save chat template dicts as lists with fixed key names * Add test for serialization/reloading * Add require_jinja just to be safe, even though I don't think we use it

…#29661) * docs:inaccurate_code_example * Inaccurate code example within inline code-documentation

…9000) * Extend import utils to cover "editable" torch versions * Re-add type hint * Remove whitespaces * Double quote strings * Update comment Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * Restore package_exists * Revert "Restore package_exists" This reverts commit 66fd2cd. --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

…g` (huggingface#29675)

… token is unset. (huggingface#29614)

* Cohere Model Release (#1) Cohere Model Release * Remove unnecessary files and code (#2) Some cleanup * Delete cohere-model directory (#3) * Make Fix (huggingface#5) * Pr fixes (huggingface#6) * fixes for pr * pr fixes for the format * pr fixes for the format * src/transformers/models/auto/tokenization_auto.py * Tokenizer test (huggingface#8) * tokenizer test * format fix * Adding Docs and other minor changes (huggingface#7) * Add modeling tests (huggingface#9) * Smol Fix (huggingface#11) * tokenization tests are fixed * format fixes * fix pr doc tests * fix pr doc tests * fix pr doc tests * fix pr style check * small changes in cohere.md * FIX: Address final comments for transformers integration (huggingface#13) * fix modeling final nits and add proper test file * for now leave empty tests * add integration test * push new test * fix modeling cohere (huggingface#14) * Update chat templates to use the new API (huggingface#15) --------- Co-authored-by: ahmetustun <ahmetustun89@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* gix * fix style * remove equivalent tests * add back for image_processor * remove again

Removed static_real_features from AutoformerForPrediction example code Signed-off-by: Maciej Torhan <maciek97x@gmail.com>

…nvironment before testing (huggingface#29477) * gix * fix style * add warning * revert * no newline * revert * revert * add CUDA as well

update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Signed-off-by: guoguangwu <guoguangwug@gmail.com>

* Update run_glue.py * Update run_glue.py * Update run_glue_no_trainer.py

* start integration * fix * add and debug tests * update tests * make pytorch serialization works * compatible with device_map and offload * fix tests * make style * add ref * guard against safetensors * add float8 and style * fix is_serializable * Fix shard_checkpoint compatibility with quanto * more tests * docs * adjust memory * better * style * pass tests * Update src/transformers/modeling_utils.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * add is_safe_serialization instead * Update src/transformers/quantizers/quantizer_quanto.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * add QbitsTensor tests * fix tests * simplify activation list * Update docs/source/en/quantization.md Co-authored-by: David Corvoysier <david.corvoysier@gmail.com> * better comment * Update tests/quantization/quanto_integration/test_quanto.py Co-authored-by: David Corvoysier <david.corvoysier@gmail.com> * Update tests/quantization/quanto_integration/test_quanto.py Co-authored-by: David Corvoysier <david.corvoysier@gmail.com> * find and fix edge case * Update docs/source/en/quantization.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * pass weights_only_kwarg instead * fix shard_checkpoint loading * simplify update_missing_keys * Update tests/quantization/quanto_integration/test_quanto.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * recursion to get all tensors * block serialization * skip serialization tests * fix * change by cuda:0 for now * fix regression * update device_map * fix doc * add noteboon * update torch_dtype * update doc * typo * typo * remove comm --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: David Corvoysier <david.corvoysier@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Younes Belkada <younesbelkada@gmail.com>

* replace breaks by a loop condition * Update src/transformers/generation/utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix speech_to_test generation tests * Add details to comment * Update tests/models/speech_to_text/test_modeling_speech_to_text.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

Revert "Fix wrong condition used in `filter_models` (huggingface#29673)" This reverts commit 174aecd.

* add attention to es/ and edit es/_toctree.yml * translate attention.md * fix transformers * fix transformers

…gingface#29671)

* inital commit * update * update conversion checkpoint * update conversion script * nits * some fixes * nits * merge * fix permute * nits * fix * nits * nits * nits * fix rope * fix both rope * nites * style * make sure flax works * fix flax init code * fix foward * nits * print flax generation out * current code * nits * SIIIIIIIIIIIIIIIIIII * update * add new tokenizer * correct fast tokenizer * fix conversion * more comments * fix modeling and conversion * nits and nits * nits testing * add some tokenization tests * add some edge cases * add slow tests and fix them * fixup * fix copies for modeling * fix copies * add 7B slow tests * fix * fix * fix tests * make tokenizer cis go green * styling * last tokenizer nits * update jax tests * fix flax for 7b * add jit testing 🤗 * cleanups * isolated nit, inv_freq for rotary_emb.inv_freq * propagate to jax * Apply suggestions from code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * adjust test * fix conversion script * change name * correct file names * update conversion script * Fix bos and eos token ids in the model configuration (#3) * update modelling * update conversion script * add static cache for gemma * fix sdpa generate * fix batched * multiple fixes * fix FA2 * final fix * Rename a few missing strings and filenames (#4) * merge with upstream main * fix copies * fix copies * fix fixup * fix fixup * fix * fix * final tests * fix fx gemma tests * fix fx bf16/fp16 tests * update slow fx tests * fx slow tests: one logits, one generation * move jit test standalone * Apply suggestions from code review * nits * tokenizer updates * more tokenization updates: custom GemmaSentencepieceExtrator * style * Update src/transformers/cache_utils.py * Update src/transformers/models/gemma/__init__.py * Update tests/models/gemma/test_modeling_flax_gemma.py * small nits * style * update tokenization test * fix the rotary embedding * with style * fix slow tests * WARNING this commit might be very important for precisions * Update tests/models/gemma/test_modeling_flax_gemma.py * Update src/transformers/models/gemma/configuration_gemma.py Co-authored-by: Lysandre Debut <hi@lysand.re> * Update src/transformers/models/gemma/modeling_flax_gemma.py Co-authored-by: Lysandre Debut <hi@lysand.re> * small nits here and there! * forgotten nit * remove on the fly computation of inv_freq * revert previous change, let's be safe and for now re-compute freq cis to make sure it's in float * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update src/transformers/models/gemma/convert_gemma_weights_to_hf.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update src/transformers/models/gemma/convert_gemma_weights_to_hf.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_flax_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_tokenization_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_tokenization_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_tokenization_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_tokenization_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * nit conversion script link * fix some tests * add not doctest and pr doctest * repo consistency * fix last CIs 🚀 * update all readmes --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com> Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: sanchit-gandhi <sanchit@huggingface.co> Co-authored-by: Lysandre Debut <hi@lysand.re>

DonggeunYu and others added 30 commits March 4, 2024 14:18

FIX [Generation] Fix some issues when running the MaxLength criteri…

81c8191

…a on CPU (huggingface#29317) fix the bitwise or issue

[docs] Update starcoder2 paper link (huggingface#29418)

ebccb09

Update starcoder2 paper link

[UdopTokenizer] Fix post merge imports (huggingface#29451)

1328522

* update * ... * nits * arf * 🧼 * beat the last guy * style everyone

fix md bug and add md to not_tested

b434f8a

more fix

0d52f9f

Revert-commit 0d52f9f (huggingface#29455)

57d007b

* style * revert with RP * nit * exact revert

[Udop imports] Processor tests were not run. (huggingface#29456)

4d892b7

* fix udop imports * sort imports

Generate: inner decoding methods are no longer public (huggingface#29437

87a0783

)

Fix bug with passing capture_* args to neptune callback (huggingface#…

8f3f8e6

…29041) * Fix bug with passing capture_* args to neptune callback * ruff happy? * instantiate (frozen)set only once * code review * code review 2 * ruff happy? * code review

Update pytest import_path location (huggingface#29154)

9c5e560

* Update to pull function from proper lib * Fix ruff formatting error * Remove accidently added file

Automatic safetensors conversion when lacking these files (huggingfac…

a69cbf4

…e#29390) * Automatic safetensors conversion when lacking these files * Remove debug * Thread name * Typo * Ensure that raises do not affect the main thread

[i18n-zh] Translate add_new_pipeline.md into Chinese (huggingface#29432)

638c423

* [i18n-zh] Translate add_new_pipeline.md into Chinese * apply suggestions from Fan-Lin

[FIX] offload_weight() takes from 3 to 4 positional arguments but 5…

00bf442

… were given (huggingface#29457) * use require_torch_gpu * enable on XPU * fix

[Docs / Awq] Add docs on exllamav2 + AWQ (huggingface#29474)

2a002d0

* add docs on exllamav2 + AWQ * Update docs/source/en/quantization.md

Generate: add tests for caches with pad_to_multiple_of (huggingface…

41f7b7a

…#29462)

Generate: get generation mode from the generation config instance 🧼 (h…

700d48f

…uggingface#29441)

Avoid dummy token in PLD to optimize performance (huggingface#29445)

0a5b051

Fix test failure on DeepSpeed (huggingface#29444)

9322576

* Fix test failure * use item

Generate: torch.compile-ready generation config preparation (huggingf…

ddb4fda

…ace#29443)

robinverduijn and others added 28 commits March 14, 2024 10:54

Add newly added PVTv2 model to all README files. (huggingface#29647)

b4b9625

Add newly added models to all README files. Also fix one relative path in README_ru.md.

[PEFT] Fix save_pretrained to make sure adapters weights are also…

c9e3c0b

… saved on TPU (huggingface#29388) * Fix for saving ad apter weights when using PEFT * Change supported-classes to PushToHubMixin

Fix TPU checkpointing inside Trainer (huggingface#29657)

956f44f

Manually call sync step

Add dataset_revision argument to RagConfig (huggingface#29610)

2cc3cc8

* add arg --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fix PVT v2 tests (huggingface#29660)

7b87ecb

* update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Generate: handle cache_position update in generate (huggingface#2…

23db187

…9467)

Inaccurate code example within inline code-documentation (huggingface…

56b64bf

…#29661) * docs:inaccurate_code_example * Inaccurate code example within inline code-documentation

Trainer: fail early in the presence of an unsavable `generation_confi…

c47fcd0

…g` (huggingface#29675)

Pipeline: use tokenizer pad token at generation time if the model pad…

53d8912

… token is unset. (huggingface#29614)

[tests] remove deprecated tests for model loading (huggingface#29450)

c1993e6

* gix * fix style * remove equivalent tests * add back for image_processor * remove again

Fix AutoformerForPrediction example code (huggingface#29639)

8a3cfaa

Removed static_real_features from AutoformerForPrediction example code Signed-off-by: Maciej Torhan <maciek97x@gmail.com>

[tests] ensure device-required software is available in the testing e…

272f48e

…nvironment before testing (huggingface#29477) * gix * fix style * add warning * revert * no newline * revert * revert * add CUDA as well

Fix wrong condition used in filter_models (huggingface#29673)

174aecd

update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

fix: typos (huggingface#29653)

03847ef

Signed-off-by: guoguangwu <guoguangwug@gmail.com>

Rename glue to nyu-mll/glue (huggingface#29679)

f02aea2

* Update run_glue.py * Update run_glue.py * Update run_glue_no_trainer.py

Revert "Fix wrong condition used in filter_models" (huggingface#29682)

5011908

Revert "Fix wrong condition used in `filter_models` (huggingface#29673)" This reverts commit 174aecd.

[docs] Spanish translation of attention.md (huggingface#29681)

00c1d87

* add attention to es/ and edit es/_toctree.yml * translate attention.md * fix transformers * fix transformers

Merge branch 'main' into add-musicgen-melody

ebeca43

make fix-copies

604a4c8

CI / generate: batch size computation compatible with all models (hug…

bf3dfd1

…gingface#29671)

Merge branch 'huggingface:main' into add-musicgen-melody

7bda3c3

fix hidden states test and batching

5863cf9

ylacombe merged commit 5ba6f3b into add-training-musicgen Mar 22, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add musicgen melody #3

Add musicgen melody #3

ylacombe commented Mar 22, 2024

Add musicgen melody #3

Add musicgen melody #3

Conversation

ylacombe commented Mar 22, 2024

What does this PR do?

Before submitting

Who can review?