Language modeling flex heads #210

calpt · 2021-07-23T13:37:33Z

Closes #53.

Waiting for #208.

This PR adds three different prediction heads for XModelWithHeads classes, depending on the model architecture:

add_causal_lm_head() adds a causal LM head for classes that support this type of head in transformers, e.g. GPT-2, BERT, ...
add_masked_lm_head() adds a masked LM head for models with MLM, e.g. BERT, RoBERTa, ...
add_seq2seq_lm_head() adds a sequence-to-sequence LM head for encoder-decoder models, e.g. BART

All heads can be automatically converted from their respective static-head counterparts (e.g. seq2seqlm from BartForConditionalGeneration).

To ensure that all conversions work as expected, a new test module was added in test_adapter_conversion.py.

Added tests for all possible head conversions.

hSterz

Looks good

calpt force-pushed the dev/lm_heads branch from 811d109 to 569fe30 Compare July 26, 2021 15:55

calpt marked this pull request as ready for review July 27, 2021 13:27

calpt added 11 commits August 4, 2021 12:40

Dependency parsing head

3125b09

Copied example script from hgiyt

5c2e7e6

Updated run_udp

4bd1909

Documentation & minor fixes

3409dd0

Some more fixes

ea044e4

Style

cb84e2e

Init flex LM heads.

ad1eef6

Finished flex LM head implementations.

c98507c

Added tests for all possible head conversions.

Invertible adapters in flex LM heads.

040a5cb

Fix output_embedding method implementation for XModelWithHeads

035e9a4

hacked fix for GPT-2 pad_token_id problem

54d4590

calpt force-pushed the dev/lm_heads branch from c65d199 to 54d4590 Compare August 4, 2021 10:40

calpt added 2 commits August 16, 2021 18:04

Merge branch 'master' into dev/lm_heads

af61f71

Merge branch 'master' into dev/lm_heads

0c533bf

calpt requested a review from hSterz August 16, 2021 16:12

Merge branch 'master' into dev/lm_heads

ad8189a

calpt mentioned this pull request Aug 23, 2021

EncoderDecoderModel adapter implementation #222

Merged

hSterz approved these changes Aug 24, 2021

View reviewed changes

calpt merged commit 84289df into adapter-hub:master Aug 24, 2021

calpt deleted the dev/lm_heads branch August 24, 2021 08:43

Provide feedback