Skip to content

Releases: deanmalmgren/textract

v1.6.4

21 Aug 17:09
Compare
Choose a tag to compare

Several updates. See changelog for details

v1.6.3

31 Jul 13:10
e500168
Compare
Choose a tag to compare

fix the msg parser and update the Travis CI build

v1.6.2

16 Jul 14:34
cee7546
Compare
Choose a tag to compare

update dependencies and make pocketsphinx optional

v1.6.1

17 Jun 13:10
Compare
Choose a tag to compare

documentation build fixes

v1.6.0

03 Apr 08:31
Compare
Choose a tag to compare

psv/tsv parsers, user-provided filename extensions, audio parsing with pocketsphinx, and several other bug fixes

v1.5.0

15 Nov 21:53
Compare
Choose a tag to compare

python 3 compatability, improved docx extraction, improved image extraction, and more.

v1.4.0

10 Oct 13:19
Compare
Choose a tag to compare

pdf layout preservation, extensionless file support, and several 🐛 fixes

v1.3.0

23 Jun 10:17
Compare
Choose a tag to compare

Added .rtf and .msg support

v1.2.0

31 Jan 08:57
Compare
Choose a tag to compare

Includes support for tiff files and a new --option/-O command line option to pass in arbitrary keyword arguments to parsers, like the language for tesseract OCR

v1.1.0

03 Oct 11:20
Compare
Choose a tag to compare

support for a variety of formats, including audio (.wav, .mp3, .ogg), csv, scanned pdfs, and htm plus various bug fixes and internal improvements.