Skip to content
Simon Gray edited this page Oct 11, 2017 · 22 revisions

This is the Wiki for corenlp-clj, a wrapper for Stanford CoreNLP written in idiomatic Clojure.

Mission

Stanford CoreNLP is a rather clunky design with lots of cruft built up over the years, but it is nevertheless a powerful tool for Natural Language Processing. This library seeks to apply a lighter and more functional style to its API design, while still retaining direct use of the data structures found in the Java version. The design is inspired in particular by Rich Hickey's philosophy of Simple Made Easy.

A secondary goal of the project is to provide sensible documentation for newcomers to NLP. Stanford CoreNLP is not a beginner-friendly tool, but corenlp-clj aims to be just that while still remaining powerful.

Current goal

The development of this library is driven right now by my own needs for sensible implementations of CoreNLP functionality related to parts-of-speech and dependency graphs in Chinese. The base pipeline is ready for any kind of annotation work -- all of which can be accessed using a set of common functions -- and I'm working on implementing specific functionality in the semgraph package at the moment.

Links worth checking out

Clone this wiki locally