Weights not initialized from pretrained model #180

lemonhu · 2019-01-11T06:03:47Z

Thanks for your awesome work!

When I execute the following code for a named entity recognition tasks:
model = BertForTokenClassification.from_pretrained("bert-base-uncased", num_labels=num_labels)

Output the following information:

Weights of BertForTokenClassification not initialized from pretrained model: ['classifier.weight', 'classifier.bias']
Weights from pretrained model not used in BertForTokenClassification: ['cls.predictions.bias', 'cls.predictions.transform.dense.weight', 'cls.predictions.transform.dense.bias', 'cls.predictions.decoder.weight', 'cls.seq_relationship.weight', 'cls.seq_relationship.bias', 'cls.predictions.transform.LayerNorm.weight', 'cls.predictions.transform.LayerNorm.bias']

What puzzles me is that the parameters of the classifier are not initialized.

The text was updated successfully, but these errors were encountered:

rodgzilla · 2019-01-11T07:10:27Z

Hi!

Those messages are correct, the pretrained weights that have been released by Google Brain are just the ones of the core network. They did not release task specific weights. To get a model that solves a specific classification task, you would have to train one yourself or get it from someone else.

@thomwolf There have been multiple issues about this specific behavior, maybe we should add some kind of text either as a print while loading the model or in the documentation. I would be happy to do it. What would you prefer?

lemonhu · 2019-01-11T07:39:40Z

Oh, I see, I will train the model with my own dataset, thank you for your answer.

thomwolf · 2019-01-14T09:08:01Z

Yes you are right @rodgzilla we should detail a bit the messages in modeling.py to say that These weights will be trained from scratch.

Co-authored-by: LRL-ModelCloud <lrl@modelcloud.ai>

thomwolf closed this as completed Jan 14, 2019

jplehmann mentioned this issue Mar 5, 2019

Why the weights are not intialized ? #339

Closed

BramVanroy mentioned this issue Dec 21, 2019

bias weights not used in T5Model #2253

Closed

ZYC-ModelCloud pushed a commit to ZYC-ModelCloud/transformers that referenced this issue Nov 14, 2024

cleanup (huggingface#180)

d866208

Co-authored-by: LRL-ModelCloud <lrl@modelcloud.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weights not initialized from pretrained model #180

Weights not initialized from pretrained model #180

lemonhu commented Jan 11, 2019 •

edited

Loading

rodgzilla commented Jan 11, 2019 •

edited

Loading

lemonhu commented Jan 11, 2019

thomwolf commented Jan 14, 2019

Weights not initialized from pretrained model #180

Weights not initialized from pretrained model #180

Comments

lemonhu commented Jan 11, 2019 • edited Loading

rodgzilla commented Jan 11, 2019 • edited Loading

lemonhu commented Jan 11, 2019

thomwolf commented Jan 14, 2019

lemonhu commented Jan 11, 2019 •

edited

Loading

rodgzilla commented Jan 11, 2019 •

edited

Loading