This repository was archived by the owner on May 14, 2022. It is now read-only.
This repository was archived by the owner on May 14, 2022. It is now read-only.
When ingesting an item a derivative should be created which builds hOCR of the document via Tesseract and stores it #57
Closed
Description
https://github.com/meh/ruby-tesseract-ocr seems to be the most active.
Users doing ingest may need to set language for best results. Should this be in Hydra::Derivatives?
Metadata
Metadata
Assignees
Labels
No labels