- Updated, more accurate NLP components and annotations
- Automatic date/time tagging by @nitinvwaran
- Enhanced dependencies in syntax files
- Include discourse dependencies and entities in .conllu files
- Give documents meaningful names
- Nest file type dirs in genre dirs
- Use amalgum/ for balanced data, excess move excess data only to other folder