Rebase hf #7

vahanhov · 2023-08-11T10:18:34Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

huggingface#24882) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* First draft * More improvements * Convert patch embedding layer * Convert all weights * Make conversion work * Improve conversion script * Fix style * Make all tests pass * Add image processor to auto mapping * Add swiglu ffn * Add image processor to conversion script * Fix conversion of giant model * Fix documentation * Fix style * Fix tests * Address comments * Address more comments * Remove unused arguments * Remove more arguments * Rename parameters * Include mask token * Address comments * Add docstring * Transfer checkpoints * Empty commit

* fix dtype issue * revert `.float()` * fix copies

* fix blip output name * add property * oops * fix failing test

* check if eval dataset is dict * formatting

…huggingface#24886) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* add llama * add other readmes * update padding id in readme * add link to paper * fix paths and tokenizer * more nits * styling * fit operation in 2 lines when possible * nits * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add form * update reademe * update readme, we don't have a default pad token * update test and tokenization * LLaMA instead of Llama * nits * add expected text * add greeedy output * styling * Update src/transformers/models/llama/modeling_llama.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * sequential device map * skip relevant changes --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Disable ipex if in use

Check for use-cpu

* fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Update supported Python and PyTorch versions in readme * Update Python, etc. versions in non-English readmes These were more out of date than in the English readme. This updates all the versions the readmes claim the repository is tested with to the same versions stated in the English readme. Those versions are current at least in the case of the Python and PyTorch versions (and less out of date for the others). * Propagate trailing whitespace fix to model list This runs "make fix-copies". The only change is the removal of whitespace. No actual information or wording is changed. * Update tested TensorFlow to 2.6 in all readmes Per pinning in setup.py Unlike Python and PyTorch, the minimum supported TensorFlow version has not very recently changed, but old versions were listed in all READMEs.

* fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

…uggingface#24907) - This results in cpu mode on Apple Silicon mps

fix typo in BARK_PRETRAINED_MODEL_ARCHIVE_LIST suno/barh should be suno/bark

Update llama2.md Fix typos in the llama2 model doc

…g_tp` (huggingface#24906) * add possibility to disable TP * fixup * adapt from offline discussions

…e#24931) [doc] image_processing_vilt.py wrong default

huggingface#24588) * docs: ko: `document_question_answering.md` * fix: resolve suggestions Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * fix: resolve suggestions Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> --------- Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

…ngface#24770) * Add text classification example * set the problem type and finetuning task * ruff reformated * fix bug for unseting label_to_id for regression * update README.md * fixed finetuning task * update comment * check if label exists in feature before removing * add useful logging

* Resolve typo in check_repo.py * Specify encoding when opening modeling files * Deprecate the OpenLlama architecture * Add disclaimer pointing to Llama I'm open to different wordings here * Match the capitalisation of LLaMA

) * replace no_cuda with use_cpu in test_pytorch_examples * remove codes that never be used * fix style

…cision_transformer (huggingface#24949) Bump pygments in /examples/research_projects/decision_transformer Bumps [pygments](https://github.com/pygments/pygments) from 2.11.2 to 2.15.0. - [Release notes](https://github.com/pygments/pygments/releases) - [Changelog](https://github.com/pygments/pygments/blob/master/CHANGES) - [Commits](pygments/pygments@2.11.2...2.15.0) --- updated-dependencies: - dependency-name: pygments dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

Fixing small typo: kwrags -> kwargs

…ngface#24916) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* make docs * fixup * resolved * remove debugs * Revert "fixup" This reverts commit 5e0f636. * prev (ignore) * fixup broke some files * remove files * reverting modeling_reformer * lang fix

* testing * example script * fix typehinting * some tests * make test * optional update * Union of arguments * does this fix the issue * remove reports * set default to False * documentation change * None support * does not need None * Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (huggingface#24549) * Fix typing annotations for FSDP and DeepSpeed in TrainingArguments * Change dict to Dict * Revert "Fix typing annotations for FSDP and DeepSpeed in TrainingArguments" (huggingface#24574) Revert "Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (huggingface#24549)" This reverts commit c5e29d4. * Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (huggingface#24549) * Fix typing annotations for FSDP and DeepSpeed in TrainingArguments * Change dict to Dict * merge * hacky fix * fixup --------- Co-authored-by: Max Ryabinin <mryabinin0@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

…ion_transformer (huggingface#24954) Bump aiohttp in /examples/research_projects/decision_transformer Bumps [aiohttp](https://github.com/aio-libs/aiohttp) from 3.8.1 to 3.8.5. - [Release notes](https://github.com/aio-libs/aiohttp/releases) - [Changelog](https://github.com/aio-libs/aiohttp/blob/v3.8.5/CHANGES.rst) - [Commits](aio-libs/aiohttp@v3.8.1...v3.8.5) --- updated-dependencies: - dependency-name: aiohttp dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

add GC support for RWKV

Change logic

Update pooler output

* docs: ko: philosophy.md * feat: chatgpt draft * fix: manual edits * fix: resolve suggestions

* Document check_dummies * Type hints and doc in other files * Document check inits * Add documentation to * Address review comments

…face#25411) * strict gen config save; Add tests * add note that the warning will be an exception in v4.34

* [WavLM] Fix Arxiv link and authors * make style

…face#25413)

fix rendering

…ce#25437) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

…ngface#25436) * Fully rebased solution * 500

…#25441) Co-authored-by: statelesshz <jihuazhong1@huawei.com>

* GTPQ integration * Add tests for gptq * support for more quantization model * fix style * typo * fix method * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add dataclass and fix quantization_method * fix doc * Update tests/quantization/gptq/test_gptq.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * modify dataclass * add gtpqconfig import * fix typo * fix tests * remove dataset as req arg * remove tokenizer import * add offload cpu quantization test * fix check dataset * modify dockerfile * protect trainer * style * test for config * add more log * overwrite torch_dtype * draft doc * modify quantization_config docstring * fix class name in docstring * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * more warning * fix 8bit kwargs tests * peft compatibility * remove var * fix is_gptq_quantized * remove is_gptq_quantized * fix wrap * Update src/transformers/modeling_utils.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * add exllama * skip test * overwrite float16 * style * fix skip test * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix docsting formatting * add doc * better test --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* fix * fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Co-authored-by: vahanhov <32771381+vahanhov@users.noreply.github.com>

* novelty debugging * running solution * message passing slightly better * simplified serialize * current code * flamingo inspired * message passing correctly implemented * positions update * removing commented code * causal message passing * edge case in case using another model besides serialize * update message passing and position embedding * Update src/transformers/models/bloom/modeling_bloom.py * removed unnecessary code

* novelty debugging * running solution * message passing slightly better * simplified serialize * current code * flamingo inspired * message passing correctly implemented * positions update * removing commented code * causal message passing * edge case in case using another model besides serialize * update message passing and position embedding * Update src/transformers/models/bloom/modeling_bloom.py * removed unnecessary code * clearer message passing code * Update src/transformers/models/bloom/causal_message_passing.py * Update src/transformers/models/bloom/causal_message_passing.py * Update src/transformers/models/bloom/causal_message_passing.py Co-authored-by: vahanhov <32771381+vahanhov@users.noreply.github.com> --------- Co-authored-by: vahanhov <32771381+vahanhov@users.noreply.github.com>

ydshieh and others added 30 commits July 18, 2023 15:08

Enable ZeroShotAudioClassificationPipelineTests::test_small_model_pt (

57da42a

huggingface#24882) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

[InstructBlip] Fix int8/fp4 issues (huggingface#24888)

a9e067a

* fix dtype issue * revert `.float()` * fix copies

[Blip] Fix blip output name (huggingface#24889)

5c5cb4e

* fix blip output name * add property * oops * fix failing test

check if eval dataset is dict (huggingface#24877)

dd49404

* check if eval dataset is dict * formatting

Separate CircleCI cache between main and pull (or other branches) (…

30c172f

…huggingface#24886) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Disable ipex env var if false (huggingface#24885)

a982c02

Disable ipex if in use

Check for accelerate env var when doing CPU only (huggingface#24890)

476be08

Check for use-cpu

Avoid some pipeline tasks to use use_cache=True (huggingface#24893)

129cb6d

* fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fix test_model_parallelism for FalconModel (huggingface#24914)

243b2ea

* fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fixed issue where ACCELERATE_USE_CPU="False" results in bool(True) (h…

aa4afa6

…uggingface#24907) - This results in cpu mode on Apple Silicon mps

fix typo in BARK_PRETRAINED_MODEL_ARCHIVE_LIST (huggingface#24902)

99c1268

fix typo in BARK_PRETRAINED_MODEL_ARCHIVE_LIST suno/barh should be suno/bark

Fix minor llama2.md model doc typos (huggingface#24909)

3a43794

Update llama2.md Fix typos in the llama2 model doc

[Llama2] replace self.pretraining_tp with `self.config.pretrainin…

ee4250a

…g_tp` (huggingface#24906) * add possibility to disable TP * fixup * adapt from offline discussions

[doc] image_processing_vilt.py wrong default documented (huggingfac…

6112b1c

…e#24931) [doc] image_processing_vilt.py wrong default

Deprecate unused OpenLlama architecture (huggingface#24922)

79444f3

* Resolve typo in check_repo.py * Specify encoding when opening modeling files * Deprecate the OpenLlama architecture * Add disclaimer pointing to Llama I'm open to different wordings here * Match the capitalisation of LLaMA

replace no_cuda with use_cpu in test_pytorch_examples (huggingface#24944

37d8611

) * replace no_cuda with use_cpu in test_pytorch_examples * remove codes that never be used * fix style

Generate: sequence bias can handle same terminations (huggingface#24822)

89136ff

Update processing_vision_text_dual_encoder.py (huggingface#24950)

85514c1

Fixing small typo: kwrags -> kwargs

Fix main_input_name in src/transformers/keras_callbacks.py (huggi…

35c0459

…ngface#24916) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

[DOCS] Example for LogitsProcessor class (huggingface#24848)

0c41765

* make docs * fixup * resolved * remove debugs * Revert "fixup" This reverts commit 5e0f636. * prev (ignore) * fixup broke some files * remove files * reverting modeling_reformer * lang fix

[RWKV] Add Gradient Checkpointing support for RWKV (huggingface#24955)

89a1f34

add GC support for RWKV

Change logic for logging in the examples (huggingface#24956)

aa1b09c

Change logic

NielsRogge and others added 26 commits August 10, 2023 09:13

[DINOv2] Update pooler output (huggingface#25392)

b175fc3

Update pooler output

🌐 [i18n-KO] Translated philosophy.md to Korean (huggingface#25010)

b14d464

* docs: ko: philosophy.md * feat: chatgpt draft * fix: manual edits * fix: resolve suggestions

Doc checks (huggingface#25408)

16edf4d

* Document check_dummies * Type hints and doc in other files * Document check inits * Add documentation to * Address review comments

Generation: strict generation config validation at save time (hugging…

123ad53

…face#25411) * strict gen config save; Add tests * add note that the warning will be an exception in v4.34

[WavLM] Fix Arxiv link and authors (huggingface#25415)

d0839f1

* [WavLM] Fix Arxiv link and authors * make style

Generate: Load generation config when device_map is passed (hugging…

3e41cf1

…face#25413)

Fix rendering for torch.compile() docs (huggingface#25432)

e7b001d

fix rendering

Add examples to tests to run when setup.py is modified (huggingfa…

2d6839e

…ce#25437) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fix issue with ratio evaluation steps and auto find batch size (huggi…

a7da299

…ngface#25436) * Fully rebased solution * 500

docs: add LLaMA-Efficient-Tuning to awesome-transformers (huggingface…

3470012

…#25441) Co-authored-by: statelesshz <jihuazhong1@huawei.com>

Fix for huggingface#25437 (huggingface#25454)

454957c

* fix * fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

not debugged code

8db3720

reference code so nothing is lost

7fcb1ab

novelty

fe48c3f

added docstrings

e47b6e8

fixed some relative import errors

96d2814

fixed small bugs

1090256

added linear layers to bloom

5beaab7

removed impossible embedding method

c7166f1

Update src/transformers/models/bloom/desequence_graph_ids.py

d7052dd

Co-authored-by: vahanhov <32771381+vahanhov@users.noreply.github.com>

Update src/transformers/models/bloom/desequence_graph_ids.py

591cf9c

Co-authored-by: vahanhov <32771381+vahanhov@users.noreply.github.com>

memory efficient message passing (#4)

13affe7

rebase from HF

4eb3009

vahanhov requested a review from zachares August 11, 2023 10:18

zachares approved these changes Aug 11, 2023

View reviewed changes

Merge branch 'main' into rebase-hf

c5e232d

zachares merged commit 2871c39 into main Aug 11, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rebase hf #7

Rebase hf #7

vahanhov commented Aug 11, 2023

Rebase hf #7

Rebase hf #7

Conversation

vahanhov commented Aug 11, 2023

What does this PR do?

Before submitting

Who can review?