Initial project skeleton invoice2data #11422
Closed
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
invoice2data is a powerful and easy-to-use Python library that saves users time and effort when it comes to extracting data from PDF invoices. By creating a template file that specifies the location of the fields that should be extracted, invoice2data pulls out structured data from invoices and converts it into a format that can be easily analyzed and processed. Invoice2data provides the power of adding custom plugins. This greatly increases the attack surface as plug-ins do not have a central update mechanism and the added functionality becomes an extra target to attackers. In order to minimize risk with this added functionality, it is important that the vulnerabilities in the code-base itself do not go uncovered. There could be detrimental effects as the risk is compounded by not only vulnerabilities that may come with the plug-ins but also the vulnerabilities in the invoice2data code-base itself. Invoice2data has a large user base with 1.6k stars on Github, 450 forked projects off of it and nearly 50 contributors. As evidenced by the many articles and videos online regarding invoice2data, it is a very popular project that, if fuzzed with OSS-Fuzz, would contribute to a more secure open source project ecosystem.
Approval from upstream can be found here: invoice-x/invoice2data#543