Skip to content

ieriii/spacy-annotator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spacy-annotator

SpaCy annotator for Named Entity Recognition (NER) using ipywidgets. The annotator allows users to quickly assign (custom) labels to one or more entities in the text, including noisy-prelabelling!

Features:

  • The annotator supports pandas dataframe: it adds annotations in a separate 'annotation' column of the dataframe;
  • Why not use transformers to label your data for you? If a model is passed into the annotator, it is used to identify entities and pre-fill the annotator for you.
  • The annotations adhere to spaCy format and are ready to serve as input to a spaCy NER model.
    No additional code required!

Blog post: medium/enrico.alemani/spacy-annotator

Installation

pip install spacy-annotator

Example: annotations using spaCy model

spacy-annotator_demo.mov

For code, see spacy_annotator demo notebook.

Contributors

dayalstrub-cma - Refactored code to class, added displacy visualisation and entity ruler.
LeafmanZ - Added to_spacy method.

Contributing

  1. Fork the repo on GitHub;
  2. Clone the project to your own machine;
  3. Commit changes to your own branch; and
  4. Push your work back up to your own fork;
  5. Submit a Pull request so that I can review your changes.

Dependencies

Spacy-annotator works with SpaCy 3.X, and ipywidgets 7.X.

References

spacy-annotator is based on spaCy and pigeon (see also PigeonXT).
Many thanks to them for making their awesome libraries publicly available. Another interesting project is Doccano.

Note: spaCy is a great library and, most importantly, free to use. So please also consider using the https://prodi.gy/ annotator to keep supporting the spaCy deveopment.

Changelog

2024-03-25: Update ipywidget requirements to >=8