-
Notifications
You must be signed in to change notification settings - Fork 229
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training scispacy pipelines require recreating the vocab file #440
Comments
Can you try adding a |
Thanks for getting back to me. I tried that, and it seems to have got beyond that issue now, but has led into this:
|
Ok, I think you are working from an outdated example, because the |
Edit: looks like the base spacy models don't have this issue, so it is something more specific. I think it might still be a question for the spacy folks, but first you should try using the config system and CLI. |
If it turns out you do just need that vocab file to continue, you can probably recreate it from the |
see #450 for a workaround |
I'm attempting to use your "en_core_sci_lg" pipeline to extract chemical entities from documents, and then using those entities as a basis to train Spacy's Entity Linker (as shown in this document). Here are the relevant portions of my code:
When I get to the error line (commented towards the end of the code block), I get the following error:
I'm running on Mac OS 12.4, M1 Pro, 16 GB unified memory. Scispacy==0.5.0, spacy==3.2.4. Are Scispacy models compatible with this workflow, or is that something that hasn't/won't be implemented? Thanks in advance!
The text was updated successfully, but these errors were encountered: