Skip to content
danieldk edited this page Oct 30, 2010 · 3 revisions

Introduction

Citar is a part-of-speech tagger. A part of speech tagger assigns morphosyntacical tags to words. For instance, to the sentence "The cat is on the mat ." we could assign the tags AT NN BEZ IN AT NN . Morphosyntactic information is used for further analysis of a sentence.

For technically inclined people: Citar is based on a trigram Hidden Markov Model with linear interpolation smoothing. Practically, this means that Citar is both fast and accurate.

Warning: The Citar API will be highly unstable for the first few versions!

What's next?

Clone this wiki locally