Error(s) in loading state_dict for ElectraForTokenClassification #28

lijiayi980130 · 2020-11-21T12:40:18Z

size mismatch for classifier.weight: copying a param with shape torch.Size([8, 1024]) from checkpoint, the shape in current model is torch.Size([9, 1024]).
Hello,I want to know why your file,"config.json"only has 8 labels for conll2003 datasets,I think it should have 9 labels.

stefan-it · 2020-11-21T13:59:29Z

Hi @lijiayi980130 ,

good question, I think you're referring to our model:

https://huggingface.co/dbmdz/electra-large-discriminator-finetuned-conll03-english

The reason for 8 labels is, that the original dataset is IOB1 labelled (yes, there are some IOB2 labelled datasets on the internet, but these are not the official ones):

$ cat eng.t* | cut -d " " -f 4 | grep -v "^$" | sort | uniq
B-LOC
B-MISC
B-ORG
I-LOC
I-MISC
I-ORG
I-PER
O

I hope this clarifies the label list entries in the configuration file 🤗

lijiayi980130 · 2020-11-23T01:09:31Z

Thank you ! I think I know

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error(s) in loading state_dict for ElectraForTokenClassification #28

Error(s) in loading state_dict for ElectraForTokenClassification #28

lijiayi980130 commented Nov 21, 2020

stefan-it commented Nov 21, 2020

lijiayi980130 commented Nov 23, 2020

Error(s) in loading state_dict for ElectraForTokenClassification #28

Error(s) in loading state_dict for ElectraForTokenClassification #28

Comments

lijiayi980130 commented Nov 21, 2020

stefan-it commented Nov 21, 2020

lijiayi980130 commented Nov 23, 2020