This repository was archived by the owner on May 14, 2022. It is now read-only.

This repository was archived by the owner on May 14, 2022. It is now read-only.

When ingesting an item a derivative should be created which builds hOCR of the document via Tesseract and stores it #57

Closed

Closed

When ingesting an item a derivative should be created which builds hOCR of the document via Tesseract and stores it#57

Assignees

opened

https://github.com/meh/ruby-tesseract-ocr seems to be the most active.

Users doing ingest may need to set language for best results. Should this be in Hydra::Derivatives?

Metadata

Assignees

tpendragon

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests