Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gemma capping #34282

Merged
merged 58 commits into from
Nov 19, 2024
Merged
Show file tree
Hide file tree
Changes from 45 commits
Commits
Show all changes
58 commits
Select commit Hold shift + click to select a range
85d549a
softcapping
ArthurZucker Jun 28, 2024
eba5191
soft cap before the mask
ArthurZucker Jun 28, 2024
b9e4a54
style
ArthurZucker Jun 28, 2024
514a839
...
ArthurZucker Jun 28, 2024
7544feb
super nit
ArthurZucker Jun 28, 2024
be1b8c3
update
ArthurZucker Oct 21, 2024
0e0511f
fixes
ArthurZucker Oct 21, 2024
03ccc22
update
ArthurZucker Oct 21, 2024
bdda724
small issue with modular
ArthurZucker Oct 21, 2024
a2b6b12
fix modular imports
ArthurZucker Oct 21, 2024
9365c1b
update
ArthurZucker Oct 21, 2024
2108ee3
fixup
ArthurZucker Oct 21, 2024
520120a
simplify a hell lot
ArthurZucker Oct 21, 2024
314ed1f
simplify cleaning imports
ArthurZucker Oct 22, 2024
8830473
finish fixing
ArthurZucker Oct 22, 2024
e4c19d7
update our design
ArthurZucker Oct 22, 2024
7922210
nits
ArthurZucker Oct 22, 2024
fa1319d
Merge branch 'main' of github.com:huggingface/transformers into gemma…
ArthurZucker Nov 1, 2024
43c68f6
use a deprecation cycle
ArthurZucker Nov 1, 2024
1aec944
updates
ArthurZucker Nov 1, 2024
93b53ef
Fix modular (recursive deps need to always be computed after merges!)
Cyrilvallez Nov 1, 2024
6f3cabb
Merge branch 'gemma-capping' of github.com:huggingface/transformers i…
ArthurZucker Nov 1, 2024
a79c4a9
push
ArthurZucker Nov 1, 2024
4c6d299
fix
ArthurZucker Nov 1, 2024
607c45d
update
ArthurZucker Nov 1, 2024
4598bba
fix modular order
Cyrilvallez Nov 1, 2024
5727270
make fix-copies
ArthurZucker Nov 1, 2024
198b4c4
updates
ArthurZucker Nov 1, 2024
3d35151
update
ArthurZucker Nov 1, 2024
da050cd
?
ArthurZucker Nov 1, 2024
e02078c
don't compile for now
ArthurZucker Nov 1, 2024
5861bbf
?
ArthurZucker Nov 4, 2024
8c47da2
fix some stuff
ArthurZucker Nov 4, 2024
09a88d9
donc!
ArthurZucker Nov 4, 2024
c06b530
fix copies
ArthurZucker Nov 4, 2024
89e6f85
update
ArthurZucker Nov 4, 2024
152e0b7
fixup
ArthurZucker Nov 4, 2024
46d8fa7
Merge branch 'main' of github.com:huggingface/transformers into gemma…
ArthurZucker Nov 4, 2024
006e869
?
ArthurZucker Nov 4, 2024
159c65a
fix two tests
ArthurZucker Nov 4, 2024
56ea5b9
fix?
ArthurZucker Nov 4, 2024
4c3deb9
for now, don't use head info
ArthurZucker Nov 4, 2024
9e3609d
eager when output attentoin and sdpa or flash as it's the simplest be…
ArthurZucker Nov 4, 2024
21edaed
fix-copies
ArthurZucker Nov 4, 2024
b5d9819
revert sdpa check
ArthurZucker Nov 4, 2024
5a3dade
Apply suggestions from code review
ArthurZucker Nov 6, 2024
faf433b
Merge branch 'main' of github.com:huggingface/transformers into gemma…
ArthurZucker Nov 6, 2024
1da75e1
rebase, fix-copies and push
ArthurZucker Nov 6, 2024
aca9120
add a slow integration test
ArthurZucker Nov 6, 2024
8f1fc5e
update the test
ArthurZucker Nov 19, 2024
5be3bab
fix left padding issue
ArthurZucker Nov 19, 2024
3e5b87a
fix test
ArthurZucker Nov 19, 2024
0513aff
remove duplicate scaling
ArthurZucker Nov 19, 2024
480aff8
quality
ArthurZucker Nov 19, 2024
603fce8
Merge branch 'main' into gemma-capping
ArthurZucker Nov 19, 2024
2a765d6
add a small test and make sure it works
ArthurZucker Nov 19, 2024
fb184be
Merge branch 'gemma-capping' of github.com:huggingface/transformers i…
ArthurZucker Nov 19, 2024
6aba68c
2b
ArthurZucker Nov 19, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading
Loading