Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Deberta/Deberta-v2] Refactor code base to support compile, export, and fix LLM #22105

Merged
merged 43 commits into from
Nov 25, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
43 commits
Select commit Hold shift + click to select a range
65e4535
some modification for roadmap
ArthurZucker Mar 11, 2023
2a02118
revert some changes
ArthurZucker Oct 12, 2024
df9a6c7
yups
ArthurZucker Oct 12, 2024
2462842
Merge branch 'main' of github.com:huggingface/transformers into refac…
ArthurZucker Oct 12, 2024
b217710
weird
ArthurZucker Oct 12, 2024
8e75639
make it work
ArthurZucker Oct 12, 2024
a1fabd9
sttling
ArthurZucker Oct 12, 2024
fbacf77
fix-copies
ArthurZucker Oct 14, 2024
ef96920
fixup
ArthurZucker Oct 14, 2024
1536ebc
renaming
ArthurZucker Oct 14, 2024
b5fc2a4
more fix-copies
ArthurZucker Oct 14, 2024
c243194
move stuff around
ArthurZucker Oct 14, 2024
ad573f7
remove torch script warnings
ArthurZucker Oct 14, 2024
675892f
ignore copies
ArthurZucker Oct 14, 2024
d91bbbc
revert bad changes
ArthurZucker Oct 14, 2024
87b632e
woops
ArthurZucker Oct 14, 2024
166cd8c
just styling
ArthurZucker Oct 14, 2024
92c1428
nit
ArthurZucker Oct 14, 2024
7bd49e7
Merge branch 'main' of github.com:huggingface/transformers into refac…
ArthurZucker Oct 22, 2024
a2f1722
Merge branch 'main' of github.com:huggingface/transformers into refac…
ArthurZucker Oct 24, 2024
a077b24
revert
ArthurZucker Oct 24, 2024
ebba01b
style fixup
ArthurZucker Oct 24, 2024
2531c18
nits configuration style
ArthurZucker Oct 24, 2024
e3d1798
fixup
ArthurZucker Oct 24, 2024
845fef6
nits
ArthurZucker Oct 24, 2024
2daa6fc
will this fix the tf pt issue?
ArthurZucker Oct 24, 2024
e7e70b7
style
ArthurZucker Oct 24, 2024
f5cf86c
???????
ArthurZucker Oct 24, 2024
45893bd
update
ArthurZucker Oct 24, 2024
87f45cc
eval?
ArthurZucker Oct 24, 2024
edc07ca
update error message
ArthurZucker Oct 24, 2024
6608ee5
updates
ArthurZucker Oct 24, 2024
adb0913
style
ArthurZucker Oct 24, 2024
3987edc
grumble grumble
ArthurZucker Oct 24, 2024
c03fdd3
Merge branch 'main' of github.com:huggingface/transformers into refac…
ArthurZucker Oct 24, 2024
c3e1cc0
update
ArthurZucker Nov 20, 2024
4a6690a
style
ArthurZucker Nov 20, 2024
29b78ec
Merge branch 'main' of github.com:huggingface/transformers into refac…
ArthurZucker Nov 20, 2024
b51d64a
nit
ArthurZucker Nov 21, 2024
cb6a11c
skip torch fx tests that were failing
ArthurZucker Nov 25, 2024
15b33a1
style
ArthurZucker Nov 25, 2024
6d0113c
skip the failing tests
ArthurZucker Nov 25, 2024
ccd6f44
skip another test and make style
ArthurZucker Nov 25, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions src/transformers/models/deberta/configuration_deberta.py
Original file line number Diff line number Diff line change
Expand Up @@ -82,6 +82,9 @@ class DebertaConfig(PretrainedConfig):
`["p2c", "c2p"]`.
layer_norm_eps (`float`, *optional*, defaults to 1e-12):
The epsilon used by the layer normalization layers.
legacy (`bool`, *optional*, defaults to `True`):
Whether or not the model should use the legacy `LegacyDebertaOnlyMLMHead`, which does not work properly
for mask infilling tasks.
ArthurZucker marked this conversation as resolved.
Show resolved Hide resolved

Example:

Expand Down Expand Up @@ -121,6 +124,7 @@ def __init__(
pos_att_type=None,
pooler_dropout=0,
pooler_hidden_act="gelu",
legacy=True,
**kwargs,
):
super().__init__(**kwargs)
Expand Down Expand Up @@ -151,6 +155,7 @@ def __init__(
self.pooler_hidden_size = kwargs.get("pooler_hidden_size", hidden_size)
self.pooler_dropout = pooler_dropout
self.pooler_hidden_act = pooler_hidden_act
self.legacy = legacy


# Copied from transformers.models.deberta_v2.configuration_deberta_v2.DebertaV2OnnxConfig
Expand Down
Loading
Loading