Skip to content

Latest commit

 

History

History
11 lines (6 loc) · 298 Bytes

README.md

File metadata and controls

11 lines (6 loc) · 298 Bytes

Document-Clusterer

A simple document cluster using single value decomposition on a corpus of CNN-stories.

cleaning.py: Processes the directory of cnn-stories and produces a useful json file

model.py: Main program which does the clustering

#TODO Make a blog post explaining about the same