-
-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade document extraction #187
Commits on Apr 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 12bfaa3 - Browse repository at this point
Copy the full SHA 12bfaa3View commit details -
feat(pdf): Add strip margin flag for PDF extraction
Add pdfplumber as main tool for extracting text from a PDF - and add a strip margin flag to enable cropping out text in the margins and removing skewed text
Configuration menu - View commit details
-
Copy full SHA for a91bc95 - Browse repository at this point
Copy the full SHA a91bc95View commit details -
tests(extraction): Add and fix tests
Added and fixed tests Modified one test pdf to better reflect the test
Configuration menu - View commit details
-
Copy full SHA for 0260927 - Browse repository at this point
Copy the full SHA 0260927View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5cb3d1c - Browse repository at this point
Copy the full SHA 5cb3d1cView commit details
Commits on May 14, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8c87680 - Browse repository at this point
Copy the full SHA 8c87680View commit details -
feat(tasks): Update extract from pdf
Change extract from pdf to drop ocr available flag
Configuration menu - View commit details
-
Copy full SHA for 5ee15b5 - Browse repository at this point
Copy the full SHA 5ee15b5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9d52634 - Browse repository at this point
Copy the full SHA 9d52634View commit details -
Configuration menu - View commit details
-
Copy full SHA for 233a615 - Browse repository at this point
Copy the full SHA 233a615View commit details -
Configuration menu - View commit details
-
Copy full SHA for c070cb2 - Browse repository at this point
Copy the full SHA c070cb2View commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for e3855f0 - Browse repository at this point
Copy the full SHA e3855f0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6dd78f1 - Browse repository at this point
Copy the full SHA 6dd78f1View commit details
Commits on May 15, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 6c0fef0 - Browse repository at this point
Copy the full SHA 6c0fef0View commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for 1e6a0c1 - Browse repository at this point
Copy the full SHA 1e6a0c1View commit details
Commits on May 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3a7666d - Browse repository at this point
Copy the full SHA 3a7666dView commit details -
Configuration menu - View commit details
-
Copy full SHA for ad55b20 - Browse repository at this point
Copy the full SHA ad55b20View commit details -
Configuration menu - View commit details
-
Copy full SHA for 06d26d0 - Browse repository at this point
Copy the full SHA 06d26d0View commit details