Merge easy_add_model feature branch #1309

jeswan · 2021-04-23T17:13:47Z

This PR merges a major refactor feature branch making it easy for users to add a Hugging Face Transformers model. This PR adds documentation for adding a model in guides/models/adding_models.md.

This PR closes #1310, #1236, #1306, and #1191

* use default return_dict in taskmodels and remove hidden state context manager in models. * return hidden states in output of model wrapper * update to transformers 4.3.3 * black

…ments (#1268) * Use jiant transformers model wrapper instead of if-else. Use taskmodel and head factory instead of if-else. * switch to ModelArchitectures enum instead of strings

) * refactor getting output from encoder to be member function of jiant model * switch to explicit encode() in jiant transformers model * fix simple runscript test * update to tokenizer 0.10.1

* add flat_strip test * add list to test cases flat_strip

* moves remaining if-else statments to jiant model or replaces with model agnostic method * switch from jiant_transformers_model to encoder * fix bug in flat_strip()

* move model specific tokenization logic to JiantTransformerModels * implement abstract methods for JiantTransformerModels

* Add DeBERTa with sanity test * fix tasks circular import * [WIP] add deberta tests * Revert "fix tasks circular import" This reverts commit f924640. * deberta tests passing with transformers 6472d8 * switch to deberta-v2 * black * flake8 * fix get_mlm_weights_dict() for deberta-v2 * update to transformers 4.5.0 * mark deberta test_export as slow * Update test_tokenization_normalization.py * add guide to add a model

codecov · 2021-04-23T17:22:16Z

Codecov Report

Merging #1309 (d0efb1c) into master (4d0f6a9) will increase coverage by 1.30%.
The diff coverage is 58.39%.

@@            Coverage Diff             @@
##           master    #1309      +/-   ##
==========================================
+ Coverage   48.46%   49.77%   +1.30%     
==========================================
  Files         163      162       -1     
  Lines       11220    11210      -10     
==========================================
+ Hits         5438     5580     +142     
+ Misses       5782     5630     -152

Impacted Files	Coverage Δ
jiant/proj/simple/runscript.py	`44.53% <ø> (ø)`
jiant/tasks/lib/ccg.py	`57.14% <0.00%> (ø)`
jiant/tasks/lib/ropes.py	`20.39% <0.00%> (ø)`
jiant/proj/main/modeling/model_setup.py	`23.01% <17.24%> (+6.26%)`	⬆️
jiant/tasks/lib/templates/squad_style/core.py	`30.20% <20.00%> (-0.51%)`	⬇️
jiant/tasks/evaluate/core.py	`33.33% <41.37%> (+0.43%)`	⬆️
jiant/tasks/lib/rte.py	`78.12% <50.00%> (-0.91%)`	⬇️
jiant/utils/python/datastructures.py	`72.41% <50.00%> (-2.79%)`	⬇️
jiant/proj/main/modeling/taskmodels.py	`33.68% <50.54%> (+8.56%)`	⬆️
jiant/proj/main/modeling/primary.py	`52.25% <56.92%> (+23.21%)`	⬆️
... and 11 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4d0f6a9...d0efb1c. Read the comment docs.

jiant/proj/main/modeling/heads.py

jeswan added 8 commits February 25, 2021 10:14

Update to Transformers v4.3.3 (#1266)

b2cfb2a

* use default return_dict in taskmodels and remove hidden state context manager in models. * return hidden states in output of model wrapper * update to transformers 4.3.3 * black

Switch to task model/head factories instead of embedded if-else state…

e004af8

…ments (#1268) * Use jiant transformers model wrapper instead of if-else. Use taskmodel and head factory instead of if-else. * switch to ModelArchitectures enum instead of strings

Refactor get_output_from_encoder() to be member of JiantTaskModel (#1283

18caa40

) * refactor getting output from encoder to be member function of jiant model * switch to explicit encode() in jiant transformers model * fix simple runscript test * update to tokenizer 0.10.1

Add tests for flat_strip() (#1289)

c9c0410

* add flat_strip test * add list to test cases flat_strip

mlm_weights(), feat_spec(), flat_strip() if-else refactors (#1288)

f027a59

* moves remaining if-else statments to jiant model or replaces with model agnostic method * switch from jiant_transformers_model to encoder * fix bug in flat_strip()

Move tokenization logic to central JiantModelTransformers method (#1290)

9a7aa78

* move model specific tokenization logic to JiantTransformerModels * implement abstract methods for JiantTransformerModels

fix tasks circular import (#1296)

4ddc5ac

jeswan force-pushed the js/feature/easy_add_model branch from 04facd0 to 78647a9 Compare April 23, 2021 17:18

black

6a4da4f

jeswan force-pushed the js/feature/easy_add_model branch from 78647a9 to 6a4da4f Compare April 23, 2021 17:20

jeswan and others added 5 commits April 24, 2021 12:00

Merge branch 'master' into js/feature/easy_add_model

c2f5152

fix test_expor_model tests

b03f47c

minor pytest fixes (add num_labels for rte, overnight flag fix)

4b825ac

bugfix for simple api notebook

4733ef5

bugfix for #1310

782d216

jeswan marked this pull request as ready for review April 26, 2021 16:23

jeswan requested a review from zphang as a code owner April 26, 2021 16:23

bugfix for #1306: simple api notebook path name

b94bf17

zphang reviewed Apr 28, 2021

View reviewed changes

jiant/proj/main/modeling/heads.py Show resolved Hide resolved

squad running

c06267e

jeswan force-pushed the js/feature/easy_add_model branch from e406685 to c06267e Compare April 29, 2021 01:40

Jesse Swanson added 4 commits April 28, 2021 21:48

2nd bugfix for #1310: not all tasks have num_labels property

b9a3af4

simple api notebook back to roberta-base

8c440ab

run test matrix for more steps to compare to master

cd1c67c

save last/best model test fix

d0efb1c

zphang approved these changes May 4, 2021

View reviewed changes

jiant/proj/main/modeling/heads.py Show resolved Hide resolved

jeswan merged commit de5437a into master May 4, 2021

jeswan deleted the js/feature/easy_add_model branch May 4, 2021 17:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge easy_add_model feature branch #1309

Merge easy_add_model feature branch #1309

jeswan commented Apr 23, 2021 •

edited

Loading

codecov bot commented Apr 23, 2021 •

edited

Loading

Merge easy_add_model feature branch #1309

Merge easy_add_model feature branch #1309

Conversation

jeswan commented Apr 23, 2021 • edited Loading

codecov bot commented Apr 23, 2021 • edited Loading

Codecov Report

jeswan commented Apr 23, 2021 •

edited

Loading

codecov bot commented Apr 23, 2021 •

edited

Loading