This repo contains training code for an algorithm to indicate whether a specific predication identified by SemRep is in fact contained within its source sentence, as well as source materials from the University.
If you wish to load the model and run it, the evaluation code in the second half of the main notebook does this.
main_notebook.ipynb
: The main content in this repo, contains narrative of the development and the actual training and evaluation codedata_prep.ipynb
: A short notebook containing some data exploration and preprocessingrequirements.txt
: A list of the packages used in this projectTask_Description.md
: The original task descriptionREADME.md
: This filedata/
: contains the originalcsv
of the data as well as apickle
file of the data after preprocessingmodels/
: contains the trained model (uploaded with Git Large File Storage)extras/
: holding space for working-idea scratch notebooks and a plot included in the main notebook