All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
0.1.4 - 2023-12-12
- mm-update: adapt to OCR-D API changes
0.1.3 - 2023-02-11
- mm2tei CLI param controlling page and line refs via @corresp
- mm-update CLI
0.1.2 - 2022-01-10
- tests for TEI API
- tests for insertion index identification
- more logging
- CLI param for output file
- CLI param for image fileGrp
- Add
front
,body
andback
per default - Log to stderr instead of stdout
- Differentiate between (physical) image nr and (logical) page nr
- Evaluate texts from all struct types but
binding
andcolour_checker
, #43 - Handle errors during language code expansion, and fallback to
Unbekannt
, #47 - Add ALTO
HYP
text content if available, #52 - Allow empty logical structMap and structLink, fallback to physical, or empty, #57
- Allow partial dmdSec (MODS) or amdSec, fallback to empty, #46, #51
- Pass all
mods:identifier
s tomsIdentifier/idno
(not just VD and URN) - Parse full
titleInfo
(main/sub/part/volume), and re-use inbiblFull
- Prefer
titleInfo/title
overdiv/@LABEL
if available - Map top logical
div/@TYPE
into allowedbiblFull/title/@level
only - Map top logical
div/@TYPE
into appropriatebibl/@type
if possible
0.1.1 - 2020-05-11
- Make full text file group selectable by user
- Add poor man's namespace versioning handling
- Make extraction of subtitles conditional on their presence
- Use "licence" for all types of licences (even unknown ones), #39
- Handle nested
@ADMID="AMD"
divs in logicalstructMap
(i.e. newspaper case), #43 - Allow for local path entries (in addition to URLs) in METS, #41
- Add special treatment for URNs and VD IDs, #37
0.1.0 - 2019-12-04
- Correctly place structures which are not on top of a page
- Set
corresp
andfacs
attributes ofpb
elements - Store links to
DEFAULT
images in METS - Tests for new functionality
- Add Changelog file, #28
- Retrieve ALTO files via a dedicated struct link member of the class
Mets
- Move text retrieval to
Alto
class
- Get rid of code artifacts carried over from
tocrify