Skip to content

v2.0

Compare
Choose a tag to compare
@NastyBoget NastyBoget released this 25 Dec 13:39
· 14 commits to master since this release
1888659
  • Fix table extraction from PDF using empty config (see issue)
  • Add more benchmarks for Tesseract
  • Fix extension extraction for file names with several dots
  • Change names of some methods and their parameters for all main classes (attachments extractors, converters, readers, metadata extractors, structure extractors, structure constructors).
    Please look to the Package reference of documentation for more details
  • Add AttachAnnotation and TableAnnotation to PPTX (see discussion)
  • Fix bugs in DOCX handling (see issues 378, 379