Add DeBERTa #1295

jeswan · 2021-03-19T14:48:03Z

The PR adds DeBERTa V2 support to jiant. The PR also includes documentation for how to add a model with the latest changes in js/feature/easy_add_model.

This reverts commit f924640.

jiant/shared/model_resolution.py

jiant/proj/main/modeling/primary.py

codecov · 2021-04-08T15:49:02Z

Codecov Report

Merging #1295 (046e4bb) into js/feature/easy_add_model (4ddc5ac) will decrease coverage by 0.17%.
The diff coverage is 48.27%.

❗ Current head 046e4bb differs from pull request most recent head d2d4894. Consider uploading reports for the commit d2d4894 to get more accurate results

@@                      Coverage Diff                      @@
##           js/feature/easy_add_model    #1295      +/-   ##
=============================================================
- Coverage                      49.83%   49.66%   -0.18%     
=============================================================
  Files                            162      162              
  Lines                          11170    11191      +21     
=============================================================
- Hits                            5567     5558       -9     
- Misses                          5603     5633      +30

Impacted Files	Coverage Δ
jiant/proj/main/modeling/model_setup.py	`22.76% <0.00%> (-0.38%)`	⬇️
jiant/proj/main/modeling/primary.py	`52.25% <40.90%> (-1.18%)`	⬇️
jiant/shared/model_resolution.py	`78.37% <100.00%> (+0.60%)`	⬆️
jiant/proj/main/export_model.py	`41.37% <0.00%> (-48.28%)`	⬇️
jiant/utils/python/io.py	`52.72% <0.00%> (-5.46%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4ddc5ac...d2d4894. Read the comment docs.

zphang

Minor documentation comments.

zphang · 2021-04-16T19:34:21Z

jiant/proj/main/modeling/model_setup.py

@@ -160,6 +160,9 @@ def load_encoder_from_transformers_weights(
    for k, v in weights_dict.items():
        if k.startswith(encoder_prefix):
            load_weights_dict[strings.remove_prefix(k, encoder_prefix)] = v
+        elif k.startswith(encoder_prefix.split("-")[0]):
+            # workaround for deberta-v2


Can you add more detail for this comment?

zphang · 2021-04-16T19:43:03Z

guides/models/adding_models.md

@@ -0,0 +1,70 @@
+ # Adding a model
+
+`jiant` supports or can easily be exteneded to support Hugging Face Hugging Face's [Transformer models](https://huggingface.co/transformers/viewer/) since `jiant` utilizes [Auto Classes](https://huggingface.co/transformers/model_doc/auto.html) to determine the architecture of the model used based on the name of the [pretrained model](https://huggingface.co/models). To add a model not currently supported in `jiant`, follow the following steps:


Typos: "exteneded", "Hugging Face Hugging Face's"

Maybe add a clarifying explanation (and let me know if this is wrong): We do use AutoModels to resolve the model in jiant, but in order to ensure the jiant pipeline works correctly (e.g. matching the correct tokenizer) and to deal with some subtle differences between models, jiant needs to know some additional information/supports specific handling for specific models, so some additional steps are needed to set up a new model from HF in jiant.

zphang · 2021-04-16T19:43:42Z

guides/models/adding_models.md

+class DebertaV2MLMHead(BaseMLMHead):
+    ...
+````
+


Conclude with "you should now be able to ..."

* Update to Transformers v4.3.3 (#1266) * use default return_dict in taskmodels and remove hidden state context manager in models. * return hidden states in output of model wrapper * Switch to task model/head factories instead of embedded if-else statements (#1268) * Use jiant transformers model wrapper instead of if-else. Use taskmodel and head factory instead of if-else. * switch to ModelArchitectures enum instead of strings * Refactor get_output_from_encoder() to be member of JiantTaskModel (#1283) * refactor getting output from encoder to be member function of jiant model * switch to explicit encode() in jiant transformers model * fix simple runscript test * update to tokenizer 0.10.1 * Add tests for flat_strip() (#1289) * add flat_strip test * add list to test cases flat_strip * mlm_weights(), feat_spec(), flat_strip() if-else refactors (#1288) * moves remaining if-else statments to jiant model or replaces with model agnostic method * switch from jiant_transformers_model to encoder * fix bug in flat_strip() * Move tokenization logic to central JiantModelTransformers method (#1290) * move model specific tokenization logic to JiantTransformerModels * implement abstract methods for JiantTransformerModels * fix tasks circular import (#1296) * Add DeBERTa (#1295) * Add DeBERTa with sanity test * fix tasks circular import * [WIP] add deberta tests * Revert "fix tasks circular import" This reverts commit f924640. * deberta tests passing with transformers 6472d8 * switch to deberta-v2 * fix get_mlm_weights_dict() for deberta-v2 * update to transformers 4.5.0 * mark deberta test_export as slow * Update test_tokenization_normalization.py * add guide to add a model * fix test_expor_model tests * minor pytest fixes (add num_labels for rte, overnight flag fix) * bugfix for simple api notebook * bugfix for #1310 * bugfix for #1306: simple api notebook path name * squad running * 2nd bugfix for #1310: not all tasks have num_labels property * simple api notebook back to roberta-base * run test matrix for more steps to compare to master * save last/best model test fix Co-authored-by: Jesse Swanson <js11133Wnyu.edu>

jeswan added 4 commits March 19, 2021 10:47

Add DeBERTa with sanity test

47882d6

fix tasks circular import

f924640

[WIP] add deberta tests

10f8bc5

Revert "fix tasks circular import"

f6fe931

This reverts commit f924640.

zphang requested changes Apr 1, 2021

View reviewed changes

jiant/shared/model_resolution.py Outdated Show resolved Hide resolved

jiant/proj/main/modeling/primary.py Outdated Show resolved Hide resolved

jeswan added 7 commits April 5, 2021 12:05

deberta tests passing with transformers 6472d8

de81244

switch to deberta-v2

1ffd255

black

f513f2c

flake8

7e8dfd3

fix get_mlm_weights_dict() for deberta-v2

21ae312

update to transformers 4.5.0

9aba6e7

mark deberta test_export as slow

9f587ce

jeswan force-pushed the js/feature/add_deberta branch from 659405c to 9f587ce Compare April 8, 2021 15:48

jeswan and others added 3 commits April 8, 2021 11:51

Merge branch 'js/feature/easy_add_model' into js/feature/add_deberta

5b54f99

Update test_tokenization_normalization.py

aabba73

add guide to add a model

046e4bb

jeswan force-pushed the js/feature/add_deberta branch from d9f8cb4 to 046e4bb Compare April 8, 2021 17:58

jeswan marked this pull request as ready for review April 8, 2021 18:00

jeswan requested a review from HaokunLiu as a code owner April 8, 2021 18:00

zphang approved these changes Apr 16, 2021

View reviewed changes

feedback from code review

d2d4894

jeswan merged commit e8536a9 into js/feature/easy_add_model Apr 23, 2021

jeswan deleted the js/feature/add_deberta branch April 23, 2021 17:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DeBERTa #1295

Add DeBERTa #1295

jeswan commented Mar 19, 2021 •

edited

Loading

codecov bot commented Apr 8, 2021 •

edited

Loading

zphang left a comment

zphang Apr 16, 2021

zphang Apr 16, 2021

zphang Apr 16, 2021

		@@ -0,0 +1,70 @@
		# Adding a model

		`jiant` supports or can easily be exteneded to support Hugging Face Hugging Face's [Transformer models](https://huggingface.co/transformers/viewer/) since `jiant` utilizes [Auto Classes](https://huggingface.co/transformers/model_doc/auto.html) to determine the architecture of the model used based on the name of the [pretrained model](https://huggingface.co/models). To add a model not currently supported in `jiant`, follow the following steps:

Add DeBERTa #1295

Add DeBERTa #1295

Conversation

jeswan commented Mar 19, 2021 • edited Loading

codecov bot commented Apr 8, 2021 • edited Loading

Codecov Report

zphang left a comment

Choose a reason for hiding this comment

zphang Apr 16, 2021

Choose a reason for hiding this comment

zphang Apr 16, 2021

Choose a reason for hiding this comment

zphang Apr 16, 2021

Choose a reason for hiding this comment

jeswan commented Mar 19, 2021 •

edited

Loading

codecov bot commented Apr 8, 2021 •

edited

Loading