NLP is a set of tools used to derive meaningful and useful information from natural language sources such as web pages and text documents. Apache OpenNLP is an open-source Java library which is used to process natural language text. You can build an efficient text processing service using this library.
- Download Apache OpenNLP distribution
- Extract the zip file
- Add
path-to/apache-opennlp-1.9.3/bin
in your Environment Variables
- Open this repo in any of your favorite IDE, i am using IntelliJ IDEA :)
- Add
opennlp-tools-1.9.3.jar
andopennlp-uima-1.9.3.jar
in your project
Download the pre-trained models provided by Apache OpenNLP as follows
- Open
OpenNLPModelsDownloader.java
file in terminal and execute command with space separated all the models taken from Apache OpenNLP website. You can download as many models as you want.
java OpenNLPModelsDownloader.java da-sent.bin en-ner-person.bin
Run SentenceDetection
or NamedEntityDetection
to test these model.