模型维度问题 #100

terminator123 · 2022-07-27T02:53:25Z

gpt2模型config里面n_positions=513，会报
size mismatch for transformer.h.0.attn.bias: copying a param with shape torch.Size([1, 1, 512, 512]) from checkpoint, the shape in current model is torch.Size([1, 1, 513, 513]).

改成512后，如果use_gpt2=True，会报
size mismatch for transformer.wpe.weight: copying a param with shape torch.Size([513, 768]) from checkpoint, the shape in current model is torch.Size([512, 768]).

pppihf · 2022-08-09T06:17:33Z

我也遇到这种情况了，请问怎么解决

terminator123 · 2022-08-09T08:57:45Z

不知道呀，你要是解决了告诉我下

zhaojunGUO · 2023-01-29T15:27:21Z

我也是，怎么解决哇

YuChuXi · 2023-05-13T05:16:40Z

tokenizer = tokenizer_class.from_pretrained(checkpoint, do_lower_case=False)  
model = model_class.from_pretrained(checkpoint)

两句中加入参数 ignore_mismatched_sizes=True 然后忽略报错就好

YuChuXi · 2023-05-13T05:17:24Z

不用动模型的配置文件

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

模型维度问题 #100

模型维度问题 #100

terminator123 commented Jul 27, 2022

pppihf commented Aug 9, 2022

terminator123 commented Aug 9, 2022

zhaojunGUO commented Jan 29, 2023

YuChuXi commented May 13, 2023

YuChuXi commented May 13, 2023

模型维度问题 #100

模型维度问题 #100

Comments

terminator123 commented Jul 27, 2022

pppihf commented Aug 9, 2022

terminator123 commented Aug 9, 2022

zhaojunGUO commented Jan 29, 2023

YuChuXi commented May 13, 2023

YuChuXi commented May 13, 2023