Skip to content
/ tmdm Public

Text mining data model with integration of various formats, annotation libraries, bells and whistles.

Notifications You must be signed in to change notification settings

uomnlp/tmdm

Repository files navigation

tmdm

Text mining data model with integration of various formats, annotation libraries, bells and whistles.

Setup

Using pip

pip install git+https://github.com/schlevik/tmdm

From source

git clone https://github.com/schlevik/tmdm
cd tmdm
pip install -r requirements.txt
pip install . --editable

Example

from tmdm import tmdm_pipeline, add_coref
from tmdm.allennlp.coref import get_coref_provider
nlp = tmdm_pipeline()
add_coref(nlp, provider=get_coref_provider("https://storage.googleapis.com/allennlp-public-models/coref-spanbert-large-2020.02.27.tar.gz"))
doc = nlp("I like cakes. They taste nice.")
assert len(doc._.corefs) == 2
assert doc[2:3]._.get_coref().coreferent(doc[4])

Proper documentation is in preparation.

About

Text mining data model with integration of various formats, annotation libraries, bells and whistles.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •