Skip to content

Pilosa is an open source, distributed bitmap index that dramatically accelerates queries across multiple, massive data sets.

License

Apache-2.0, Unknown licenses found

Licenses found

Apache-2.0
LICENSE
Unknown
license.exceptions
Notifications You must be signed in to change notification settings

tgruben/pilosa

This branch is 27 commits behind FeatureBaseDB/featurebase:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ad3f2d8 · Mar 28, 2023
Mar 17, 2023
Mar 27, 2023
Nov 15, 2022
Dec 12, 2022
Sep 29, 2022
Mar 24, 2023
Feb 24, 2023
Mar 14, 2023
Mar 22, 2023
Feb 6, 2023
Mar 22, 2023
Dec 12, 2022
Mar 22, 2023
Mar 28, 2023
Mar 12, 2023
Feb 17, 2023
Mar 24, 2023
Mar 20, 2023
Mar 14, 2023
Jan 11, 2023
Sep 6, 2022
Sep 2, 2022
Sep 6, 2022
Sep 29, 2022
Mar 27, 2023
Nov 15, 2022
Jan 23, 2023
Mar 3, 2023
Feb 3, 2023
Sep 2, 2022
Sep 6, 2022
Sep 2, 2022
Nov 15, 2022
Mar 24, 2023
Mar 24, 2023
Jan 23, 2023
Jan 23, 2023
Dec 12, 2022
Dec 12, 2022
Mar 12, 2023
Mar 14, 2023
Nov 8, 2022
Sep 2, 2022
Mar 24, 2023
Oct 25, 2022
Nov 15, 2022
Mar 14, 2023
Mar 28, 2023
Nov 15, 2022
Jan 19, 2023
Sep 2, 2022
Jan 25, 2023
Sep 2, 2022
Feb 17, 2023
Mar 14, 2023
Sep 30, 2022
Mar 14, 2023
Sep 2, 2022
Sep 6, 2022
Nov 15, 2022
Mar 14, 2023
Mar 15, 2023
Mar 20, 2023
Mar 14, 2023
Sep 2, 2022
Dec 12, 2022
Jan 19, 2023
Jan 19, 2023
Dec 12, 2022
Feb 1, 2023
Dec 12, 2022
Mar 7, 2023
Jan 13, 2023
Sep 2, 2022
Mar 8, 2023
Sep 2, 2022
Feb 14, 2023
Mar 24, 2023
Mar 24, 2023
Dec 12, 2022
Jan 19, 2023
Mar 10, 2023
Nov 15, 2022
Jan 23, 2023
Mar 3, 2023
Jan 10, 2023
Sep 6, 2022
Sep 6, 2022
Jan 12, 2023
Dec 12, 2022
Dec 12, 2022
Dec 12, 2022
Jan 20, 2023
Mar 12, 2023
Mar 24, 2023
Jan 25, 2023
Nov 15, 2022
Sep 2, 2022
Sep 2, 2022
Jan 19, 2023
Nov 15, 2022
Oct 28, 2022
Sep 30, 2022
Mar 24, 2023
Sep 6, 2022
Feb 3, 2023
Sep 2, 2022
Dec 12, 2022
Sep 6, 2022
Mar 24, 2023
Dec 12, 2022
Mar 24, 2023
Mar 24, 2023
Jan 10, 2023
Oct 28, 2022
Sep 2, 2022
Mar 24, 2023
Mar 24, 2023
Sep 2, 2022
Sep 2, 2022
Mar 20, 2023
Mar 20, 2023
Jan 23, 2023
Dec 12, 2022
Dec 12, 2022
Feb 17, 2023
Oct 28, 2022
Nov 17, 2022
Mar 12, 2023
Sep 29, 2022
Mar 24, 2023
Mar 14, 2023
Oct 28, 2022
Mar 10, 2023
Sep 6, 2022
Jan 19, 2023
Mar 24, 2023
Oct 28, 2022
Mar 6, 2023
Feb 16, 2023
Mar 24, 2023
Jan 23, 2023
Sep 2, 2022
Sep 2, 2022
Sep 2, 2022
Sep 2, 2022
Sep 30, 2022
Mar 14, 2023
Feb 21, 2023
Mar 24, 2023
Jan 19, 2023
Feb 22, 2023
Sep 6, 2022
Feb 22, 2023
Sep 6, 2022
Mar 24, 2023
Jan 19, 2023
Jan 12, 2023
Mar 24, 2023
Sep 2, 2022
Mar 24, 2023
Nov 15, 2022
Mar 24, 2023
Dec 12, 2022
Mar 24, 2023
Dec 12, 2022
Nov 15, 2022
Sep 2, 2022
Sep 2, 2022
Sep 2, 2022
Sep 6, 2022
Dec 12, 2022
Feb 22, 2023
Mar 3, 2023
Jan 10, 2023
Jan 12, 2023
Jan 12, 2023
Mar 24, 2023
Sep 30, 2022
Nov 17, 2022
Dec 12, 2022
Sep 2, 2022
Sep 30, 2022
Sep 2, 2022
Sep 30, 2022
Feb 17, 2023
Sep 2, 2022
Mar 24, 2023
Sep 30, 2022
Mar 7, 2023

FeatureBase

Pilosa is now FeatureBase

As of September 7, 2022, the Pilosa project is now FeatureBase. The core of the project remains the same: FeatureBase is the first real-time distributed database built entirely on bitmaps. (More information about updated capabilities and improvements below.)

FeatureBase delivers low-latency query results, regardless of throughput or query volumes, on fresh data with extreme efficiency. It works because bitmaps are faster, simpler, and far more I/O efficient than traditional column-oriented data formats. With FeatureBase, you can ingest data from batch data sources (e.g. S3, CSV, Snowflake, BigQuery, etc.) and/or streaming data sources (e.g. Kafka/Confluent, Kinesis, Pulsar).

For more information about FeatureBase, please visit www.featurebase.com.

Getting Started

Build FeatureBase Server from source

  1. Install go. Ensure that your shell's search path includes the go/bin directory.
  2. Clone the FeatureBase repository (or download as zip).
  3. In the featurebase directory, run make install to compile the FeatureBase server binary. By default, it will be installed in the go/bin directory.
  4. In the idk directory, run make install to compile the ingester binaries. By default, they will be installed in the go/bin directory.
  5. Run featurebase server --handler.allowed-origins=http://localhost:3000 to run FeatureBase server with default settings (learn more about configuring FeatureBase at the link below). The --handler.allowed-origins parameter allows the standalone web UI to talk to the server; this can be omitted if the web UI is not needed.
  6. Run curl localhost:10101/status to verify the server is running and accessible.

Ingest Data and Query

  1. Run
molecula-consumer-csv \
    --index repository \
    --header "language__ID_F,project_id__ID_F" \
    --id-field project_id \
    --batch-size 1000 \
    --files example.csv

This will ingest the example.csv file into a FeatureBase table called repository. If the table does not exist, it will be automatically created. Learn more about ingesting data into FeatureBase

  1. Query your data.
curl localhost:10101/index/repository/query \
     -X POST \
     -d 'Row(example=5)'

Learn about supported SQL, native Pilosa Query Language (PQL).

Data Model

Because FeatureBase is built on bitmaps, there is bit of a learning curve to grasp how your data is represented. Learn about Data Modeling.

More Information

Installation

Configuration

Community

You can email us at [email protected] or learn more about contributing at https://www.featurebase.com/community.

Chat with us: https://discord.gg/FBn2vEp7Na

What's Changed Since the Pilosa Days?

A lot has changed since the days of Pilosa. This list highlights some new capabilites included in FeatureBase. We have also made signficant improvements to the performance, scalability, and stability of the FeatureBase product.

  • Query Languages: FeatureBase supports Pilosa Query Language (PQL), as well as SQL
  • Stream and Batch Ingest: Combine real-time data streams with batch historical data and act on it within milliseconds.
  • Mutable: Perform inserts, updates, and deletes at scale, in real time and on-the-fly. This is key for meeting data compliance requirements, and for reflecting the constantly-changing nature of high-volume data.
  • Multi-Valued Set Fields: Store multiple comma-delimited values within a single field while increasing query performance of counts, TopKs, etc.
  • Time Quantums: Setting a time quantum on a field creates extra views which allow ranged Row queries down to the time interval specified. For example, if the time quantum is set to YMD, ranged Row queries down to the granularity of a day are supported.
  • RBF storage backend: this is a new compressed bitmap format which improves performance in a number of ways: ACID support on a per shard basis, prevents issues with the number of open files, reduces memory allocation and lock contention for reads, provides more consistent garbage collection, and allows backups to run concurrently with writes. However, because of this change, Pilosa backup files cannot be restored into FeatureBase.

License

FeatureBase is licensed under the Apache License, Version 2.0

About

Pilosa is an open source, distributed bitmap index that dramatically accelerates queries across multiple, massive data sets.

Resources

License

Apache-2.0, Unknown licenses found

Licenses found

Apache-2.0
LICENSE
Unknown
license.exceptions

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Go 95.8%
  • TypeScript 2.1%
  • JavaScript 1.1%
  • Shell 0.3%
  • Makefile 0.3%
  • SCSS 0.2%
  • Other 0.2%