SpaCy annotator for Named Entity Recognition (NER) using ipywidgets. The annotator allows users to quickly assign (custom) labels to one or more entities in the text, including noisy-prelabelling!
Features:
- The annotator supports pandas dataframe: it adds annotations in a separate 'annotation' column of the dataframe;
- Why not use transformers to label your data for you? If a model is passed into the annotator, it is used to identify entities and pre-fill the annotator for you.
- The annotations adhere to spaCy format and are ready to serve as input to a spaCy NER model.
No additional code required!
Blog post: medium/enrico.alemani/spacy-annotator
pip install spacy-annotator
spacy-annotator_demo.mov
For code, see spacy_annotator demo notebook.
dayalstrub-cma - Refactored code to class, added displacy visualisation and entity ruler.
LeafmanZ - Added to_spacy
method.
- Fork the repo on GitHub;
- Clone the project to your own machine;
- Commit changes to your own branch; and
- Push your work back up to your own fork;
- Submit a Pull request so that I can review your changes.
Spacy-annotator works with SpaCy 3.X, and ipywidgets 7.X.
spacy-annotator is based on spaCy and pigeon (see also PigeonXT).
Many thanks to them for making their awesome libraries publicly available. Another interesting project is Doccano.
Note: spaCy is a great library and, most importantly, free to use. So please also consider using the https://prodi.gy/ annotator to keep supporting the spaCy deveopment.
2024-03-25: Update ipywidget requirements to >=8