This repository has been archived by the owner on Mar 5, 2024. It is now read-only.
This release represents a major rewrite of sequins. We've been running this new code internally for more than six months, and we think it's in pretty good shape!
The main difference is that sequins can now run as a cluster, partitioning and replicating data across multiple machines. There are lots of other changes, as well, including:
- A better on-disk format, including on-disk compression
- File-based configuration, with lots of configuration options
- A web interface with status information, and lots of monitoring hooks
- A comprehensive manual
To say the least, this release is highly backwards incompatible with sequins 0.4.x. Moreover, input data that was organized for 0.4.x needs to be organized slightly different in S3 or HDFS. You can read more about the new source root layout here.
Additionally, since v1.0.0rc1, a few issues have been fixed: