Skip to content

Latest commit

 

History

History
26 lines (21 loc) · 834 Bytes

README.md

File metadata and controls

26 lines (21 loc) · 834 Bytes

Phrase Sense Disambiguation for NLP using PhraseBase

A Hadoop MapReduce java implementation for disambiguating entities from simple English sentences using a phrasal knowledge base.

Requirements

Steps

  • Clone the repository

  • Run in terminal hadoop com.sun.tools.javac.Main PhraseBase.java

  • Run jar cf PhraseBase.jar PhraseBase*.class

  • Create input.txt and write any simple sentence that has

    • subject
    • verb phrase
    • object

    eg. Sachin Tendulkar was born in Mumbai.

    You can provide multiple sentences in the input file but be sure to write each sentence on a different line and end them with a period (.)

  • Run hadoop jar PhraseBase.jar PhraseBase input.txt