Here we have used the concept of BOW and Term frequency and Inverse document frequency to process the questions and use Logistic regression, Random forest and Naive bayes to train the data and got a F1 score of around 93. Below libraries needs to be installed on computer
- Pandas
- Numpy
- Sklearn
- Nltk This is a single file document, and this file don’t require any dependent files expect the above installed file.
- bag of words
- term frequency and inverse document frequency
- vectorization
- Naive Bayes
- Logistic Regression
- RandomForest
1. Systems and Approaches for Question Answering: ailao.eu/yodaqa/odbstud.pdf
2. Learning Question Classifiers: www.aclweb.org/anthology/C02-1150