This is the Cascading.Multitool (Multitool) application.
Multitool provides a simple command line interface for building data
processing jobs. Think of this as grep
, sed
, and awk
for
Hadoop, which also supports joins between multiple data-sets.
See the Multitool page for installation and usage.
To build Multitool, you may download the source code from GitHub:
https://github.com/cascading/cascading.multitool
To build multitool with gradle do this:
$ gradle build
See apl.txt
in this directory.