Try 1.6 #1

zcain117 · 2020-08-27T23:19:21Z

No description provided.

…uggingface#6411) * add encoder-decoder for roberta * fix headmask * apply Sylvains suggestions * fix typo * Apply suggestions from code review

* add targets arg to fill-mask pipeline * add tests and more error handling * quality * update docstring

* cleanup torch unittests: part 2 * remove trailing comma added by isort, and which breaks flake * one more comma * revert odd balls * part 3: odd cases * more ["key"] -> .key refactoring * .numpy() is not needed * more unncessary .numpy() removed * more simplification

* Update README.md * Update README.md * Update README.md

* fix * fix2 * fix3

@sgugger

* Test model outputs equivalence * Fix failing tests * From dict to kwargs * DistilBERT * Addressing @sgugger and @patrickvonplaten's comments

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

…ggingface#6457) * Add more token classification examples * POS tagging example * Phrase chunking example * PR review fixes * Add conllu to third party list (used in token classification examples)

* Clean Dir after testing * remove pabee ignore

* add MBartForConditionalGeneration * style * rebase and fixes * add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS * fix docs * don't ignore mbart * doc * fix mbart fairseq link * put mbart before bart * apply doc suggestions

* Use hash to clean the test dirs * Use hash to clean the test dirs * Use hash to clean the test dirs * fix

* add cross attention layers for gpt2 * make gpt2 cross attention work * finish bert2gpt2 * add explicit comments * remove attention mask since not yet supported * revert attn mask in pipeline * Update src/transformers/modeling_gpt2.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_encoder_decoder.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* change unique_no_split_tokens's type to set * use sorted list instead of set * style

* Generation doc * MBartForConditionalGeneration (huggingface#6441) * add MBartForConditionalGeneration * style * rebase and fixes * add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS * fix docs * don't ignore mbart * doc * fix mbart fairseq link * put mbart before bart * apply doc suggestions * Use hash to clean the test dirs (huggingface#6475) * Use hash to clean the test dirs * Use hash to clean the test dirs * Use hash to clean the test dirs * fix * [EncoderDecoder] Add Cross Attention for GPT2 (huggingface#6415) * add cross attention layers for gpt2 * make gpt2 cross attention work * finish bert2gpt2 * add explicit comments * remove attention mask since not yet supported * revert attn mask in pipeline * Update src/transformers/modeling_gpt2.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_encoder_decoder.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Sort unique_no_split_tokens to make it deterministic (huggingface#6461) * change unique_no_split_tokens's type to set * use sorted list instead of set * style * Import accuracy_score (huggingface#6480) * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address comments * Styling * Generation doc * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address comments * Styling Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Kevin Canwen Xu <canwenxu@126.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> Co-authored-by: gijswijnholds <gijswijnholds@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

Currently with the bug introduced we're taking two optimizer steps per batch: one global one, where `xm.optimizer_step` injects a CRS between all cores in training, and one without. This has been affecting training accuracy (for example, XLNet GLUE on MNLI is not converging, etc.).

…gface#6404)

…el cards (huggingface#6527) Co-authored-by: Fabio Souza <fabiosouza@neuralmind.ai>

* Add Model Card for electra-base-german-uncased * Update README.md Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>

- remove invalid `ENV_` prefix. - add a few ':' while at it

@thomwolf

* Logging * Style * hf_logging > utils.logging * Address @thomwolf's comments * Update test * Update src/transformers/benchmark/benchmark_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Revert bad change Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add tf graph compile tests * fix conflict * remove more tf transpose statements * fix conflicts * fix comment typos * move function to class function * fix black * fix black * make style

* add xlm-roberta-large-xnli model card * update pt example * typo

* Added model cards for 4 models Added model cards for: - roberta-base-bulgarian - roberta-base-bulgarian-pos - roberta-small-bulgarian - roberta-small-bulgarian-pos * fixed link text * Update README.md * Create README.md * removed trailing bracket * Add language metadata Co-authored-by: Julien Chaumond <chaumond@gmail.com>

…ingface#6727) * added model card for codeswitch-spaeng-sentiment-analysis-lince model also update other model card * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * Update README.md

* Create README.md * Update README.md

* AdaFactor optimizer ported from fairseq. Tested for T5 finetuning and MLM -- reduced memory consumption compared to ADAM. * update PR fixes, add basic test * bug -- incorrect params in test * bugfix -- import Adafactor into test * bugfix -- removed accidental T5 include * resetting T5 to master * bugfix -- include Adafactor in __init__ * longer loop for adafactor test * remove double error class declare * lint * black * isort * Update src/transformers/optimization.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * single docstring * Cleanup docstring Co-authored-by: Nikolai Y <nikolai.yakovenko@point72.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

…ingface#6713) * Align TF NER example over the PT one * Fix Dataset call * Fix gradient accumulation training * Apply style * Address Sylvain's comments * Address Sylvain's comments * Apply style

huggingface#6523) * [testing] replace hardcoded paths to allow running tests from anywhere * fix the merge conflict

…e#6429) * [test schedulers] small improvement * cleanup

* [doc] multiple corrections to "Summary of the tasks" * add a new "docs" target to validate docs and document it * fix mixup

sshleifer and others added 30 commits August 12, 2020 11:41

[s2s] add BartTranslationDistiller for distilling mBART (huggingface#…

f94a52c

…6363)

Get GKE logs via kubectl logs instead of gcloud logging read. (huggin…

fd3de20

…gface#6446)

[EncoderDecoder] Add encoder-decoder for roberta/ vanilla longformer (h…

0735def

…uggingface#6411) * add encoder-decoder for roberta * fix headmask * apply Sylvains suggestions * fix typo * Apply suggestions from code review

add targets arg to fill-mask pipeline (huggingface#6239)

bc82047

* add targets arg to fill-mask pipeline * add tests and more error handling * quality * update docstring

Update README.md (huggingface#6435)

0ed7c00

* Update README.md * Update README.md * Update README.md

Fix docs and bad word tokens generation_utils.py (huggingface#6387)

9d94aec

* fix * fix2 * fix3

typo fix (huggingface#6462)

54c687e

Test model outputs equivalence (huggingface#6445)

f7cbc13

* Test model outputs equivalence * Fix failing tests * From dict to kwargs * DistilBERT * Addressing @sgugger and @patrickvonplaten's comments

add LongformerTokenizerFast in AutoTokenizer (huggingface#6463)

a442f87

add BartTokenizerFast in AutoTokenizer (huggingface#6464)

f51161e

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

Add POS tagging and Phrase chunking token classification examples (hu…

eda07ef

…ggingface#6457) * Add more token classification examples * POS tagging example * Phrase chunking example * PR review fixes * Add conllu to third party list (used in token classification examples)

Mult rouge by 100: standard units (huggingface#6359)

e92efcf

Clean directory after script testing (huggingface#6453)

7bc0056

* Clean Dir after testing * remove pabee ignore

Fix typo (huggingface#6469)

05810cd

MBartForConditionalGeneration (huggingface#6441)

680f133

* add MBartForConditionalGeneration * style * rebase and fixes * add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS * fix docs * don't ignore mbart * doc * fix mbart fairseq link * put mbart before bart * apply doc suggestions

Use hash to clean the test dirs (huggingface#6475)

eb613b5

* Use hash to clean the test dirs * Use hash to clean the test dirs * Use hash to clean the test dirs * fix

Sort unique_no_split_tokens to make it deterministic (huggingface#6461)

9a8c168

* change unique_no_split_tokens's type to set * use sorted list instead of set * style

Import accuracy_score (huggingface#6480)

b5ba758

Add examples/bert-loses-patience who can help (huggingface#6499)

fe61c05

Fixes paths with spaces in seq2seq example (huggingface#6493)

2060181

[s2s] docs, document desired filenames nicely (huggingface#6525)

72add6c

[lightning_base] fix s2s logging, only make train_loader once (huggin…

84c265f

…gface#6404)

Update bert-base-portuguese-cased and bert-large-portuguese-cased mod…

6d38ab1

…el cards (huggingface#6527) Co-authored-by: Fabio Souza <fabiosouza@neuralmind.ai>

typos (huggingface#6505)

df15c7c

Add Model Card for electra-base-german-uncased (huggingface#6496)

3c72f55

* Add Model Card for electra-base-german-uncased * Update README.md Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>

[doc] fix invalid env vars (huggingface#6504)

423eb5b

- remove invalid `ENV_` prefix. - add a few ':' while at it

JayYip and others added 30 commits August 26, 2020 05:15

Fix tf boolean mask in graph mode (huggingface#6741)

461ae86

Fix optimizer (huggingface#6717)

02e8cd5

isort 5

e78c110

Black 20 release

a75c64d

[TF Longformer] Improve Speed for TF Longformer (huggingface#6447)

858b7d5

* add tf graph compile tests * fix conflict * remove more tf transpose statements * fix conflicts * fix comment typos * move function to class function * fix black * fix black * make style

add xlm-roberta-large-xnli model card (huggingface#6723)

99407f9

* add xlm-roberta-large-xnli model card * update pt example * typo

[model_cards] Fix tiny typos

3242e4d

Create model card for lordtt13/COVID-SciBERT (huggingface#6718)

e10fb9c

Model card for kuisailab/albert-base-arabic (huggingface#6729)

70c96a1

* Create README.md * Update README.md

Model card for kuisailab/albert-xlarge-arabic (huggingface#6731)

8e0d51e

* Create README.md * Update README.md

Model card for kuisailab/albert-large-arabic (huggingface#6730)

61b9ed8

* Create README.md * Update README.md

add __init__.py to utils (huggingface#6754)

10a3450

Model Card for Multilingual Passage Reranking BERT (huggingface#6755)

434936f

[s2s] run_eval.py QOL improvements and cleanup(huggingface#6746)

61518e2

create ProtBert-BFD model card. (huggingface#6724)

05e7150

s2s distillation uses AutoModelForSeqToSeqLM (huggingface#6761)

4bd7be9

Adafactor docs (huggingface#6765)

41aa2b4

Fix the TF Trainer gradient accumulation and the TF NER example (hugg…

6f289dc

…ingface#6713) * Align TF NER example over the PT one * Fix Dataset call * Fix gradient accumulation training * Apply style * Address Sylvain's comments * Address Sylvain's comments * Apply style

Format

0d2c111

Fix it to work with BART (huggingface#6756)

c225e87

add nlp install (huggingface#6767)

9d1b4db

[testing] replace hardcoded paths to allow running tests from anywhere (

e6b811f

huggingface#6523) * [testing] replace hardcoded paths to allow running tests from anywhere * fix the merge conflict

[test schedulers] adjust to test the first step's reading (huggingfac…

dbfe34f

…e#6429) * [test schedulers] small improvement * cleanup

new Makefile target: docs (huggingface#6510)

70fccc5

* [doc] multiple corrections to "Summary of the tasks" * add a new "docs" target to validate docs and document it * fix mixup

Format

42fddac

try running existing circleci

fd8b1b3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try 1.6 #1

Try 1.6 #1

zcain117 commented Aug 27, 2020

Try 1.6 #1

Are you sure you want to change the base?

Try 1.6 #1

Conversation

zcain117 commented Aug 27, 2020