Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Asking for the script for CoNLL2003 data #3

Open
pinesnow72 opened this issue Apr 20, 2021 · 1 comment
Open

Asking for the script for CoNLL2003 data #3

pinesnow72 opened this issue Apr 20, 2021 · 1 comment

Comments

@pinesnow72
Copy link

pinesnow72 commented Apr 20, 2021

Thank you for sharing your code and I am very interested in dice loss especially for NER task.
Here you are sharing CoNLL2003 data of MRC format but the script for NER (with hyper-parameters) is for OntoNotes5 (English). Can you share the script (actually, the hyper-parameters) suitable for CoNLL2003 data?
When I used the script for OntoNotes5 with CoNLL2003 data, I could get about 92.08 F1 (with 10 epochs) but this is a bit lower performance than 93.33 F1, which is reported in the ACL2020 paper. On the contrary, I could get 92.35 F1 with BCE loss and 5 epochs.

And can you share OntoNotes5 data of MRC format or at least query sentences?

@xiaoya-li
Copy link
Contributor

Thanks for asking.

For CoNLL-2003, please download the MRC-style dataset from here and run the script for reproducing experiment results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants