Add support for XLNet and the new whole-word-masking variant of BERT. #730

sleepinyourhat · 2019-06-24T14:57:08Z

No description provided.

sleepinyourhat · 2019-06-24T14:58:18Z

This shouldn't require any code changes, just updating to the next release of pytorch_pretrained_bert once it comes out. We can test it out by pulling pytorch_pretrained_bert at head now.

sleepinyourhat · 2019-06-26T20:37:37Z

At least the new BERT model is now out in pytorch_pretrained_bert. Now, does anyone see where/how we're installing pytorch_pretrained_bert?

This doesn't need to go along with 1.0, but as soon as the code is stable, we should add it. Frankly: For the vast majority of new experiments, it doesn't make sense to use plain BERT.

sleepinyourhat · 2019-06-26T21:24:39Z

It looks like this'll break old task pickles, but doesn't require any change to our code.

W4ngatang · 2019-06-26T21:31:16Z

I'm not super familiar with XLNet on a low level; do they use the same BERT modeling tricks (special tokens, concatenating inputs, etc.) ?

Mostly our BERT-specific code is very BERT-specific, so we'd likely have to rename various variables at a minimum to incorporate XLNet. I'm worried about more drastic changes; I don't think our code is well-abstracted enough to support arbitrary (pretrained LM) model switch. BERT with whole word masking seems like it shouldn't break too much, if anything at all.

sleepinyourhat · 2019-06-26T21:34:01Z

I also misread the HuggingFace readme—XLNet isn't ready yet—so we'll have to wait and see.

I found the requirement, though—we were sneakily installing the HuggingFace repo via Allen. I'll at least try to get whole-word set up.

sleepinyourhat · 2019-07-16T15:07:30Z

The HuggingFace update is out—I'll start looking into adding support...

sleepinyourhat changed the title ~~Add support for whole-word~~ Add support for the new whole-word-masking variant of BERT. Jun 24, 2019

sleepinyourhat added help wanted Extra attention is needed high-priority Fix this before addressing any other major issue. labels Jun 26, 2019

sleepinyourhat mentioned this issue Jun 26, 2019

Update AllenNLP to >= 0.8.4 #709

Closed

sleepinyourhat changed the title ~~Add support for the new whole-word-masking variant of BERT.~~ Add support for XLNet and the new whole-word-masking variant of BERT. Jun 26, 2019

sleepinyourhat mentioned this issue Jun 26, 2019

AllenNLP update/support for whole-word-masking BERT [WIP] #765

Merged

sleepinyourhat added the 0.0.x release-on-fix Put out a new 0.0.x release when this is fixed. label Jul 12, 2019

HaokunLiu self-assigned this Dec 27, 2019

HaokunLiu mentioned this issue Jan 11, 2020

Update to transformers 2.3.0 & Add ALBERT #990

Merged

HaokunLiu closed this as completed Mar 10, 2020

jeswan mentioned this issue Sep 17, 2020

[CLOSED] Add support for XLNet and the new whole-word-masking variant of BERT. nyu-mll/jiant-v1-legacy#730

Closed

jeswan added the jiant-v1-legacy Relevant to versions <= v1.3.2 label Sep 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for XLNet and the new whole-word-masking variant of BERT. #730

Add support for XLNet and the new whole-word-masking variant of BERT. #730

sleepinyourhat commented Jun 24, 2019

sleepinyourhat commented Jun 24, 2019

sleepinyourhat commented Jun 26, 2019 •

edited

Loading

sleepinyourhat commented Jun 26, 2019

W4ngatang commented Jun 26, 2019

sleepinyourhat commented Jun 26, 2019

sleepinyourhat commented Jul 16, 2019

Add support for XLNet and the new whole-word-masking variant of BERT. #730

Add support for XLNet and the new whole-word-masking variant of BERT. #730

Comments

sleepinyourhat commented Jun 24, 2019

sleepinyourhat commented Jun 24, 2019

sleepinyourhat commented Jun 26, 2019 • edited Loading

sleepinyourhat commented Jun 26, 2019

W4ngatang commented Jun 26, 2019

sleepinyourhat commented Jun 26, 2019

sleepinyourhat commented Jul 16, 2019

sleepinyourhat commented Jun 26, 2019 •

edited

Loading