Sync/v3.5.1 #85

calpt · 2020-11-18T14:14:00Z

No description provided.

* Create README.md model card for cambridgeltl/BioRedditBERT-uncased. * Update model_cards/cambridgeltl/BioRedditBERT-uncased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>

Improved TensorBoard and Wandb integration, as well as optuna and ray/tune support, with minor modifications to trainer core code.

* correct xlm prophetnet auto model and examples * fix line-break docs

… reversed) (#7949) * fix docstring for 'special_tokens_mask' * revert auto formatter changes * revert another auto format * revert another auto format

@pjox

Hat/tip @pjox

Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* added qg evaluation notebook * Update notebooks/README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* slow tests should be slow * exception note * style * integrate LysandreJik's notes with some expansions * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * another slow test * fix link, and prose * clarify. * note from Sam * typo Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

@Narsil

cc @Narsil @patrickvonplaten

…7860) * basic config test with online model * typo * style * better test

* ADD: add whole word mask proxy for both eng and chinese * MOD: adjust format * MOD: reformat code * MOD: update import * MOD: fix bug * MOD: add import * MOD: fix bug * MOD: decouple code and update readme * MOD: reformat code * Update examples/language-modeling/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/language-modeling/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/language-modeling/run_language_modeling.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/language-modeling/run_language_modeling.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/language-modeling/run_language_modeling.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/language-modeling/run_language_modeling.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * change wwm to whole_word_mask * reformat code * reformat * format * Code quality * ADD: update chinese ref readme * MOD: small changes * MOD: small changes2 * update readme Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

Looking at the current community notebooks, it seems that few are targeted for absolute beginners and even fewer are written with TensorFlow. This notebook describes absolutely everything a beginner would need to know, including how to save/load their model and use it for new predictions (this is often omitted in tutorials) Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* fix config save * add test * add config class variable and another test * line break * fix fsmt and typo * god am I making many errors today :-/ * Update src/transformers/configuration_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

…eds is None. (#7977) Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Actually make the "translation", "translation_XX_to_YY" task behave correctly. Background: - Currently "translation_cn_to_ar" does not work. (only 3 pairs are supported) - Some models, contain in their config the correct values for the (src, tgt) pair they can translate. It's usually just one pair, and we can infer it automatically from the `model.config.task_specific_params`. If it's not defined we can still probably load the TranslationPipeline nevertheless. Proposed fix: - A simplified version of what could become more general which is a `parametrized` task. "translation" + (src, tgt) in this instance it what we need in the general case. The way we go about it for now is simply parsing "translation_XX_to_YY". If cases of parametrized task arise we should preferably go in something closer to what `datasets` propose which is having a secondary argument `task_options`? that will be close to what that task requires. - Should be backward compatible in all cases for instance `pipeline(task="translation_en_to_de") should work out of the box. - Should provide a warning when a specific translation pair has been selected on behalf of the user using `model.config.task_specific_params`. * Update src/transformers/pipelines.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Start simplification * More progress * Finished script * Address comments and update tests instructions * Wrong test * Accept files as inputs and fix test * Update src/transformers/trainer_utils.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Fix labels and add combined score * Add special labels * Update TPU command * Revert to old label strategy * Use model labels * Fix for STT-B * Styling * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Code styling * Fix review comments Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

@sgugger

* FillMaskPipeline: support passing top_k on __call__ Also move from topk to top_k * migrate to new param name in tests * Review from @sgugger

* Only log total_flos at the end of training * Fix test

* add zero shot pipeline tags * rm default and fix yaml format * rm DS_Store * add bart large default * don't add more typos Co-authored-by: Julien Chaumond <chaumond@gmail.com> * add multiple multilingual examples * improve multilingual examples for single-label Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Fix checkpoint loading in Trainer * Fix typo

* Move NoLayerEmbedTokens * TFWrappedEmbeddings * Add comment

* support lowercase tokenizer * fix arg pos

* Add new token classification example * Remove txt file * Add test * With actual testing done * Less warmup is better * Update examples/token-classification/run_ner_new.py Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address review comments * Fix test * Make Lysandre happy * Last touches and rename * Rename in tests * Address review comments * More run_ner -> run_ner_old Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* fairseq broke chkpt data - fixing that * style * support older bpecodes filenames - specifically "code" in iwslt14

* add training tests * correct longformer * fix docs * fix some tests * fix some more train tests * remove ipdb * fix multiple edge case model training * fix funnel and prophetnet * clean gpt models * undo renaming of albert

* gpu decorators table * whitespace * Update docs/source/testing.rst Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * whitespace Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* add a multi-gpu job for all example tests * run only ported tests * rename * explain why env is re-activated on each step * mark all unported/checked tests with @require_torch_non_multigpu_but_fix_me * style * Apply suggestions from code review Co-authored-by: Sam Shleifer <sshleifer@gmail.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

…llowing paper (#8417) * Move XLNet memory length FutureWarning * isort * style * Changed default XLNet memory length

@sshleifer

* fix typo * rm use_cdn & references, and implement new hf_bucket_url * I'm pretty sure we don't need to `read` this file * same here * [BIG] file_utils.networking: do not gobble up errors anymore * Fix CI 😇 * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Tiny doc tweak * Add doc + pass kwarg everywhere * Add more tests and explain cc @sshleifer let me know if better Co-Authored-By: Sam Shleifer <sshleifer@gmail.com> * Also implement revision in pipelines In the case where we're passing a task name or a string model identifier * Fix CI 😇 * Fix CI * [hf_api] new methods + command line implem * make style * Final endpoints post-migration * Fix post-migration * Py3.6 compat cc @stefan-it Thank you @stas00 Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Patch token classification pipeline * Some added tests for TokenClassificationArgumentHandler (#8366) Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* Update RST * Finer details * Re-organize * Style

MichalPleban and others added 30 commits October 21, 2020 08:30

Create README.md (#7819)

35d2ad5

Model card for German BERT fine-tuned for LER/NER (#7855)

2b07ec7

Create README.md (#7857)

58fb25f

* Create README.md model card for cambridgeltl/BioRedditBERT-uncased. * Update model_cards/cambridgeltl/BioRedditBERT-uncased/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>

Add AI-SOCO models (#7867)

bf162ce

TensorBoard/Wandb/optuna/raytune integration improvements. (#7935)

e174bfe

Improved TensorBoard and Wandb integration, as well as optuna and ray/tune support, with minor modifications to trainer core code.

[ProphetNet] Correct Doc string example (#7944)

9b6610f

* correct xlm prophetnet auto model and examples * fix line-break docs

fix test (#7947)

52decab

fix 'encode_plus' docstring for 'special_tokens_mask' (0s and 1s were…

16da877

… reversed) (#7949) * fix docstring for 'special_tokens_mask' * revert auto formatter changes * revert another auto format * revert another auto format

[model_cards] camembert: dataset = oscar

f8d3695

Hat/tip @pjox

[seq2seq testing] multigpu test run via subprocess (#7281)

8b38173

Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

added qg evaluation notebook (#7958)

4abb7ff

* added qg evaluation notebook * Update notebooks/README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

Herbert tokenizer auto load (#7968)

95792a9

support relative path for best_model_checkpoint (#7973)

f774b2e

[model_card] t5-11b move disclaimer to top of page

a7db81c

cc @Narsil @patrickvonplaten

Disable inference API for t5-11b (#7978)

3479787

[fsmt test] basic config test with online model + super tiny model (#…

64b4d25

…7860) * basic config test with online model * typo * style * better test

Remove the else branch adding 0 to the hidden state if token_type_emb…

901e9b8

…eds is None. (#7977) Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

FillMaskPipeline: support passing top_k on __call__ (#7971)

ff65bea

* FillMaskPipeline: support passing top_k on __call__ Also move from topk to top_k * migrate to new param name in tests * Review from @sgugger

Only log total_flos at the end of training (#7981)

06fc395

* Only log total_flos at the end of training * Fix test

Fix documentation redirect

467573d

Reload checkpoint (#7984)

5ae935d

* Fix checkpoint loading in Trainer * Fix typo

[gh ci] less output ( --durations=50) (#7989)

5ac0751

Move NoLayerEmbedTokens (#7945)

0397619

* Move NoLayerEmbedTokens * TFWrappedEmbeddings * Add comment

stas00 and others added 24 commits November 9, 2020 10:41

[fsmt tokenizer] support lowercase tokenizer (#8389)

78d706f

* support lowercase tokenizer * fix arg pos

Bump tokenizers (#8419)

c7cb1aa

Fix typo

5c766ec

[fsmt convert script] fairseq broke chkpt data - fixing that (#8377)

d4d1fbf

* fairseq broke chkpt data - fixing that * style * support older bpecodes filenames - specifically "code" in iwslt14

Deprecate old data/metrics functions (#8420)

5204051

[docs] remove sshleifer from issue-template :( (#8418)

46509d1

Fix bart shape comment (#8423)

a8339b9

[docs] [testing] gpu decorators table (#8422)

ef032dd

* gpu decorators table * whitespace * Update docs/source/testing.rst Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * whitespace Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

Check all models are in an auto class (#8425)

a39218b

Changing XLNet default from not using memories to 512 context size fo…

4185b11

…llowing paper (#8417) * Move XLNet memory length FutureWarning * isort * style * Changed default XLNet memory length

Patch token classification pipeline (#8364)

850afb4

* Patch token classification pipeline * Some added tests for TokenClassificationArgumentHandler (#8366) Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

Update links from s3 to huggingface.co

55e8d0c

Fix style

ad2303a

Model sharing rst (#8439)

9cebee3

* Update RST * Finer details * Re-organize * Style

Release: v3.5.0

818878d

Fix typo

1fb3617

Release: v3.5.1

d5b3e56

Merge tag 'v3.5.1' into sync/v3.5.1

41ac520

Doc style & check fixes

3981c2b

Merge branch 'master' into sync/v3.5.1

7a7230a

calpt added the sync label Nov 18, 2020

Bug fixes for adapter-supporting models

659285c

calpt mentioned this pull request Nov 18, 2020

Sync/v3.4.0 #77

Closed

calpt marked this pull request as ready for review November 18, 2020 18:45

calpt merged commit 0270fdd into adapter-hub:master Nov 19, 2020

calpt deleted the sync/v3.5.1 branch November 19, 2020 09:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync/v3.5.1 #85

Sync/v3.5.1 #85

calpt commented Nov 18, 2020

Sync/v3.5.1 #85

Sync/v3.5.1 #85

Conversation

calpt commented Nov 18, 2020