Sync/v4.21.2 #404

calpt · 2022-08-19T15:56:14Z

No description provided.

* add: check labels for detr object detection doctests * add: check shapes * add: add detr to documentation_tests.py * fix: make fixup output * fix: add a comment

* Update modeling_yoso.py * make fixup * Update modeling_yoso.py That should be it copied from previous PR

* Feat: add missing type hints for QDQBertModel * fix: ran black and isort * feat: Add missing output type for QDQBertModel * feat: Add type hints for QDQBertLMHeadModel and models starting with QDQBertFor * fix: add missing return type for QDQBertModel * fix: remove wrong return type for QDQBertEmbeddings * fix: readded config argument to load_tf_weights_in_qdqbert * fix: add BertConfig type to BertEmbeddings config due t checko error in ci * fix: removed config type hints to avoid copy checks

Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* add skeleton files * fix cpu inference link * add hint to make clear that single gpu section contains general info * add new files to ToC * update toctree to have subsection for performance * add "coming soon" to the still empty sections * fix missing title * fix typo * add reference to empty documents * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* few fixes: - hardcode tokenizer padding side - remove unused args * few fixes: - added new attribute on TokenizerTesterMixin - added new slow test - remove unused arg on tokenizer class * make style * Update src/transformers/models/bloom/tokenization_bloom_fast.py Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com> * make quality * apply changes - remove new attribute - redefine test on the class * add comments Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

* Fix tests that broke when models used batchnorm * Initializing the model twice does not actually... ...give you the same weights each time. I am good at machine learning. * Fix speed regression

As shown in the colab notebook I added the missing type hints for " CvtForImageClassification CvtModel "

* Adjust test arguments and use a new example test

* wip * rebase * all tests pass * rebase * ready for PR * address comments * fix styles * add require_torch to pipeline test * remove remote image to improve CI consistency * address comments; fix tf/flax tests * address comments; fix tf/flax tests * fix tests; add alias * repo consistency tests * Update src/transformers/pipelines/visual_question_answering.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * address comments * Update src/transformers/pipelines/visual_question_answering.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * merge * wip * wip * wip * most basic tests passes * all tests pass now * relative embedding * wip * running make fixup * remove bert changes * fix doc * fix doc * fix issues * fix doc * address comments * fix CI * remove redundant copied from * address comments * fix broken test Co-authored-by: Sijun He <sijunhe@Sijuns-MacBook-Pro.local> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

…chs in no_trainer scripts (#17856)

…573) * Auto-build on setup modification * Modify push-caller * Make adjustments based on code review

* Improve vision models * Add a lot of improvements * Remove to_2tuple from swin tests * Fix TF Swin * Fix more tests * Fix copies * Improve more models * Fix ViTMAE test * Add channel check for TF models * Add proper channel check for TF models * Apply suggestion from code review * Apply suggestions from code review * Add channel check for Flax models, apply suggestion * Fix bug * Add tests for greyscale images * Add test for interpolation of pos encodigns Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

* Copied all the changes from the last PR * added in documentation_tests.txt * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: vishwaspai <vishwas.pai@emplay.net> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* fix(ConstrainedBeamSearchScorer.step_sentence_constraint): avoid hypothesis duplication between topk and advance * fix(GenerationMixin.constrained_beam_search): appropriately assign beam scores instead of token scores

* fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Add CodeGen model * Add missing key and switch order of super() * Fix torch.ones init with uint8 instead of bool * Address comments: copy statements and doc * update tests * remove old model parallel * fix batch gen tests * fix batch gen test * update test_gpt2_sample_max_time * fix codgen test and revert gpt2 test change * Fix incorrect tie_word_embedding value, typo, URL * Fix model order in README and styling * Reorder model list alphabetically * Set tie_word_embedding to False by default * Apply suggestions from code review * Better attn mask name & remove attn masked_bias * add tokenizer for codegen * quality * doc tokenizer * fix-copies * add CodeGenTokenizer in converter * make truncation optional * add test for truncation * add copyright * fix-copies * fix fast tokenizer decode * Update src/transformers/models/codegen/tokenization_codegen.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * increase vocab_size in tests Co-authored-by: patil-suraj <surajp815@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* feat: Add type hints for GPTNeoxForCausalLM and GPTNeoXModel * fix: removed imported Dict type * fix: Removed unused List import

* Use higher value for hidden_size in Flax BigBird test * remove 5e-5 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Properly get tests deps in test_fetcher * Remove print

* Improve doc test * Improve code example of segmentation model * Apply suggestion * Update src/transformers/models/detr/modeling_detr.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Improve docs * Improve docs of speech one as well * Apply suggestions from code review Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

…" (#18061)

* Remove all uses of six * fix quality

Add function to the submodule init

…trained` (#18428) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

qherreros and others added 30 commits June 23, 2022 13:26

add doctests for DETR (#17786)

ab223fc

* add: check labels for detr object detection doctests * add: check shapes * add: add detr to documentation_tests.py * fix: make fixup output * fix: add a comment

TF: generate without tf.TensorArray (#17801)

5cce307

Update type hints modeling_yoso.py (#17827)

4297f44

* Update modeling_yoso.py * make fixup * Update modeling_yoso.py That should be it copied from previous PR

change message (#17836)

b2fdbac

Fix properties of unset special tokens in non verbose mode (#17797)

3eed553

Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

Fix an error message in BigBird (#17840)

5bc779a

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fix broken test for models with batchnorm (#17841)

1a7ef33

* Fix tests that broke when models used batchnorm * Initializing the model twice does not actually... ...give you the same weights each time. I am good at machine learning. * Fix speed regression

Update modeling_cvt.py (#17846)

e70abda

As shown in the colab notebook I added the missing type hints for " CvtForImageClassification CvtModel "

Change no trainer image_classification test (#17635)

acb709d

* Adjust test arguments and use a new example test

Index RNG states by global rank in saves (#17852)

7c1b912

Properly calculate the total train iterations and recalculate num epo…

75259b4

…chs in no_trainer scripts (#17856)

Auto-build Docker images before on-merge if setup.py was changed (#17…

893ab12

…573) * Auto-build on setup modification * Modify push-caller * Make adjustments based on code review

[tests/VisionEncoderDecoder] import to_2tuple from test utils (#17865)

73a0496

Fix Splinter test (#17854)

4474900

* fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

[CodeGen] support device_map="auto" for sharded checkpoints (#17871)

061a73d

Add type hints for gptneox models (#17858)

ef28a40

* feat: Add type hints for GPTNeoxForCausalLM and GPTNeoXModel * fix: removed imported Dict type * fix: Removed unused List import

Fix: torch.utils.checkpoint import error. (#17849)

2ef94ee

Use higher value for hidden_size in Flax BigBird test (#17822)

0e0f1f4

* Use higher value for hidden_size in Flax BigBird test * remove 5e-5 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Skip test_multi_gpu_data_parallel_forward for MaskFormer (#17864)

494aac6

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fix test_inference_instance_segmentation_head (#17872)

b03be78

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Properly get tests deps in test_fetcher (#17870)

e8eb699

* Properly get tests deps in test_fetcher * Remove print

CLI: handle multimodal inputs (#17839)

cc5c061

NielsRogge and others added 12 commits July 27, 2022 10:43

[EncoderDecoder] Improve docs (#18271)

36f9859

* Improve docs * Improve docs of speech one as well * Apply suggestions from code review Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

fix loading from pretrained for sharded model with `torch_dtype="auto…

9e564d0

…" (#18061)

Remove all uses of six (#18318)

3496ea8

* Remove all uses of six * fix quality

sentencepiece shouldn't be required for the fast LayoutXLM tokenizer

31b3a12

Fix sacremoses sof dependency for Transofmers XL

0daa202

Add function to the submodule init

Release: v4.21.0

a9eee2f

Fix load of model checkpoints in the Trainer (#18470)

dea58d6

Patch release: v4.21.1

f0d4968

remove files from 'v4.21.1' before merge

cf8a9ca

Merge stripped branch 'v4.21.1'

642d3a7

Style cleanups after sync

6238a99

calpt added the sync label Aug 19, 2022

calpt force-pushed the sync/v4.21.1 branch from 9609f72 to c47c55f Compare August 19, 2022 19:04

Fixes after sync

78504f3

calpt force-pushed the sync/v4.21.1 branch from c47c55f to 78504f3 Compare August 19, 2022 20:09

calpt and others added 4 commits August 24, 2022 15:24

Merge branch 'master' into sync/v4.21.1

46cd423

Accept trust_remote_code and ignore it in `PreTrainedModel.from_pre…

c5f7df8

…trained` (#18428) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Patch release: v4.21.2

b487096

Currently no fx tracing for adapter models

9513121

calpt force-pushed the sync/v4.21.1 branch from 756cf87 to 9513121 Compare August 25, 2022 10:02

calpt added 3 commits August 26, 2022 15:06

Merge tag 'v4.21.2' into sync/v4.21.1

c9bde0f

Code style

fe85424

Merge branch 'master' into sync/v4.21.1

6873096

calpt changed the title ~~Sync/v4.21.1~~ Sync/v4.21.2 Aug 31, 2022

Fixes for fx compability

4f97fc6

calpt force-pushed the sync/v4.21.1 branch from 223f3fd to 4f97fc6 Compare August 31, 2022 19:23

calpt marked this pull request as ready for review September 1, 2022 08:02

calpt merged commit ce69f41 into adapter-hub:master Sep 1, 2022

calpt deleted the sync/v4.21.1 branch September 1, 2022 08:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync/v4.21.2 #404

Sync/v4.21.2 #404

calpt commented Aug 19, 2022

Sync/v4.21.2 #404

Sync/v4.21.2 #404

Conversation

calpt commented Aug 19, 2022