A command line tool for reading and writing data to/from multiple locations and across multiple formats.
This project is under active development and many features are considered alpha.
Please do play around with this project in order to provide early feedback, but do expect things to change until we hit 1.0 release.
Documentation can be found at: https://docs.clusterless.io/tessellate/1.0-wip/index.html
All tessellate releases are available via Homebrew:
brew tap clusterlesshq/tap
brew install tessellate
tess --version
Available on Docker Hub:
And, you can download the latest releases directly from GitHub:
A primary activity of any data-engineering effort is to format and organize data for different access patterns.
For example, logs frequently arrive as lines of text, but are often best consumed as structured data. And different stakeholders may have different needs of the log data, so it must be organized in different ways that support those needs.
Tessellate was designed to support data engineers and data scientists in their efforts to manage data.
Tessellate may be used from the command line, but also natively supports the Clusterless workload model.
So that the Cascading WIP releases can be retrieved, to gradle.properties
add:
githubUsername=[your github username]
githubPassword=[your github personal access token]
See creating a personal access token here.
./gradlew installDist
./tessellate-main/build/install/tessellate/bin/tess --help
Documentation coming soon, but see the tests for usage.