Releases: deanmalmgren/textract
Releases · deanmalmgren/textract
v1.6.4
v1.6.3
fix the msg
parser and update the Travis CI build
v1.6.2
update dependencies and make pocketsphinx
optional
v1.6.1
documentation build fixes
v1.6.0
psv/tsv parsers, user-provided filename extensions, audio parsing with pocketsphinx, and several other bug fixes
v1.5.0
python 3 compatability, improved docx extraction, improved image extraction, and more.
v1.4.0
pdf layout preservation, extensionless file support, and several 🐛 fixes
v1.3.0
Added .rtf and .msg support
v1.2.0
Includes support for tiff files and a new --option/-O command line option to pass in arbitrary keyword arguments to parsers, like the language for tesseract OCR
v1.1.0
support for a variety of formats, including audio (.wav, .mp3, .ogg), csv, scanned pdfs, and htm plus various bug fixes and internal improvements.