Request to process only new PDFs #17

cutright · 2020-01-07T22:16:21Z

Feature request to ignore previously processed PDFs

cutright · 2020-01-07T23:46:27Z

main.process_files() in branch issue_17 has the feature to ignore previously processed files. Collecting all processed files is pretty fast, but it seems like the bottleneck is iterating through the OS directory, not parsing the data. Or perhaps the time is spent checking if a file name exists in the previously processed files.

Needs investigation.

cutright added the enhancement New feature or request label Jan 7, 2020

cutright added a commit that referenced this issue Jan 7, 2020

initial commit for issue #17

90416bf

cutright added this to the v0.3.1 milestone Jan 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request to process only new PDFs #17

Request to process only new PDFs #17

cutright commented Jan 7, 2020

cutright commented Jan 7, 2020

Request to process only new PDFs #17

Request to process only new PDFs #17

Comments

cutright commented Jan 7, 2020

cutright commented Jan 7, 2020