Tika is a package for the Datagrok platform. It showcases the ability of integration with Apache Tika for files metadata extraction using CLI tools.
Here are the files and directories of particular interest:
-
TikaExtractor.c : "tika-extractor" CLI tool source code
-
tika-extractor.py : Python wrapper for "tika-extractor" CLI tool using Scripting
-
bin : directory with java tika metadata extractor tool binaries
See also: