This repository has been archived by the owner on Mar 25, 2024. It is now read-only.
AMS labeled dataset, 08.2018
Eliminated memory leaks related to libxml use, this release has been used to generate the AMS paragraph dataset induced by the arXMLiv 08.2018 HTML5 corpus.