[ `gemma`] Adds support for Gemma 💎 #29167

ArthurZucker · 2024-02-21T12:31:00Z

What does this PR do?

Adds support for Gemma 💎

…model-addition into add-golden-gate

…model-addition into HEAD

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

younesbelkada

Huge work !

… into add-golden-gate

HuggingFaceDocBuilderDev · 2024-02-21T13:21:21Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* inital commit * update * update conversion checkpoint * update conversion script * nits * some fixes * nits * merge * fix permute * nits * fix * nits * nits * nits * fix rope * fix both rope * nites * style * make sure flax works * fix flax init code * fix foward * nits * print flax generation out * current code * nits * SIIIIIIIIIIIIIIIIIII * update * add new tokenizer * correct fast tokenizer * fix conversion * more comments * fix modeling and conversion * nits and nits * nits testing * add some tokenization tests * add some edge cases * add slow tests and fix them * fixup * fix copies for modeling * fix copies * add 7B slow tests * fix * fix * fix tests * make tokenizer cis go green * styling * last tokenizer nits * update jax tests * fix flax for 7b * add jit testing 🤗 * cleanups * isolated nit, inv_freq for rotary_emb.inv_freq * propagate to jax * Apply suggestions from code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * adjust test * fix conversion script * change name * correct file names * update conversion script * Fix bos and eos token ids in the model configuration (#3) * update modelling * update conversion script * add static cache for gemma * fix sdpa generate * fix batched * multiple fixes * fix FA2 * final fix * Rename a few missing strings and filenames (#4) * merge with upstream main * fix copies * fix copies * fix fixup * fix fixup * fix * fix * final tests * fix fx gemma tests * fix fx bf16/fp16 tests * update slow fx tests * fx slow tests: one logits, one generation * move jit test standalone * Apply suggestions from code review * nits * tokenizer updates * more tokenization updates: custom GemmaSentencepieceExtrator * style * Update src/transformers/cache_utils.py * Update src/transformers/models/gemma/__init__.py * Update tests/models/gemma/test_modeling_flax_gemma.py * small nits * style * update tokenization test * fix the rotary embedding * with style * fix slow tests * WARNING this commit might be very important for precisions * Update tests/models/gemma/test_modeling_flax_gemma.py * Update src/transformers/models/gemma/configuration_gemma.py Co-authored-by: Lysandre Debut <hi@lysand.re> * Update src/transformers/models/gemma/modeling_flax_gemma.py Co-authored-by: Lysandre Debut <hi@lysand.re> * small nits here and there! * forgotten nit * remove on the fly computation of inv_freq * revert previous change, let's be safe and for now re-compute freq cis to make sure it's in float * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update src/transformers/models/gemma/convert_gemma_weights_to_hf.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update src/transformers/models/gemma/convert_gemma_weights_to_hf.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_flax_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_tokenization_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_tokenization_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_tokenization_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_tokenization_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/models/gemma/test_modeling_gemma.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * nit conversion script link * fix some tests * add not doctest and pr doctest * repo consistency * fix last CIs 🚀 * update all readmes --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com> Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: sanchit-gandhi <sanchit@huggingface.co> Co-authored-by: Lysandre Debut <hi@lysand.re>

ArthurZucker and others added 30 commits January 21, 2024 20:36

inital commit

7434ea2

update

6165925

update conversion checkpoint

69006fa

update conversion script

cb80199

nits

08252ce

some fixes

32ea5fb

nits

888ab95

Merge branch 'add-golden-gate' of https://github.com/huggingface/new-…

ce0aa57

…model-addition into add-golden-gate

merge

e303e48

fix permute

aefc4bc

nits

78de9f5

Merge branch 'add-golden-gate' of https://github.com/huggingface/new-…

fb2917d

…model-addition into HEAD

fix

f3ad1b8

nits

e3a0bbd

nits

bbee069

nits

e0e0646

fix rope

5f63b4c

fix both rope

4734805

merge

9b27edd

nites

d20574d

style

3f76b2f

make sure flax works

3449b4b

fix flax init code

fbbe149

fix foward

bf8ed52

nits

8273aa7

print flax generation out

cf6345c

current code

6f49e21

nits

e2b09ee

SIIIIIIIIIIIIIIIIIII

94e1020

update

afd89e8

younesbelkada and others added 12 commits February 21, 2024 13:24

Update tests/models/gemma/test_modeling_gemma.py

d83e098

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Update tests/models/gemma/test_modeling_gemma.py

09717b6

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Update tests/models/gemma/test_tokenization_gemma.py

cce69c0

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Update tests/models/gemma/test_tokenization_gemma.py

dde30a5

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Update tests/models/gemma/test_tokenization_gemma.py

02a2d38

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Update tests/models/gemma/test_tokenization_gemma.py

c975dae

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Update tests/models/gemma/test_modeling_gemma.py

5cf4da3

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Update tests/models/gemma/test_modeling_gemma.py

f198015

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Update tests/models/gemma/test_modeling_gemma.py

ac82e00

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Update tests/models/gemma/test_modeling_gemma.py

7253c9f

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

Update tests/models/gemma/test_modeling_gemma.py

bc7ebaf

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

nit conversion script link

4ad0d52

younesbelkada approved these changes Feb 21, 2024

View reviewed changes

younesbelkada and others added 5 commits February 21, 2024 12:46

fix some tests

7db13fa

add not doctest and pr doctest

60f8ba6

Merge branch 'add-golden-gate' of github.com:huggingface/transformers…

1bf51b3

… into add-golden-gate

repo consistency

2a4d326

fix last CIs 🚀

ea9eb10

ArthurZucker marked this pull request as ready for review February 21, 2024 13:00

ArthurZucker added the New model label Feb 21, 2024

update all readmes

556f743

ArthurZucker merged commit 594c127 into main Feb 21, 2024
2 of 6 checks passed

ArthurZucker deleted the add-golden-gate branch February 21, 2024 13:21

carmocca mentioned this pull request Feb 21, 2024

Support Gemma Lightning-AI/litgpt#940

Closed

gante mentioned this pull request Feb 26, 2024

tracker: generate compatibility with torch.compile #28981

Open

32 tasks

khipp mentioned this pull request Mar 10, 2024

Add missing localized READMEs to the copies check #29575

Merged

1 task

warner-benjamin mentioned this pull request Apr 4, 2024

Llama uses significantly more memory in 4.38 & 4.39 than 4.37 with identical code #30010

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ `gemma`] Adds support for Gemma 💎 #29167

[ `gemma`] Adds support for Gemma 💎 #29167

ArthurZucker commented Feb 21, 2024

younesbelkada left a comment

HuggingFaceDocBuilderDev commented Feb 21, 2024

[ gemma] Adds support for Gemma 💎 #29167

[ gemma] Adds support for Gemma 💎 #29167

Conversation

ArthurZucker commented Feb 21, 2024

What does this PR do?

younesbelkada left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Feb 21, 2024

[ `gemma`] Adds support for Gemma 💎 #29167

[ `gemma`] Adds support for Gemma 💎 #29167