Skip to content

Latest commit

 

History

History
37 lines (20 loc) · 1.33 KB

File metadata and controls

37 lines (20 loc) · 1.33 KB

Cover

layoutlm-invoices

Description

This is a fine-tuned version of the multi-modal LayoutLM model for the task of question answering on invoices and other documents. It has been fine-tuned on a proprietary dataset of invoices as well as both SQuAD2.0 and DocVQA for general comprehension.

Fine Tune Results

Non-consecutive tokens

Unlike other QA models, which can only extract consecutive tokens (because they predict the start and end of a sequence), this model can predict longer-range, non-consecutive sequences with an additional classifier head. For example, QA models often encounter this failure mode:

Before

Before Results

After

However this model is able to predict non-consecutive tokens and therefore the address correctly.

After Results

License

This model is available on HuggingFace under the CC by NC-SA 4.0 License.

Citation

This model was created by the team at Impira.