This data lives at https://github.com/vega/vega-datasets
Common repository for example datasets used by vega related projects. Keep changes to this repository minimal as other projects (vega, vega-editor, vega-lite, polestar, voyager) use this data in their tests and for examples.
The list of sources is in sources.md.
Add this to your package.json:
"vega-datasets": "vega/vega-datasets#gh-pages"
You can also get the data directly via HTTP served by Github like:
https://vega.github.io/vega-datasets/data/cars.json
You can use git subtree to add these datasets to a project. Add data git subtree add
like:
git subtree add --prefix path-to-data [email protected]:vega/vega-datasets.git gh-pages
Update to the latest version of vega-data with
git subtree pull --prefix path-to-data [email protected]:vega/vega-datasets.git gh-pages
- Remove all tabs in
github.csv
to prevent incorrect field name parsing.
- Dates in
movies.json
are all recognized as date types by datalib - Dates in
crimea.json
are now in ISO format (YYYY-MM-DD)
- Fix
cars.json
date format
- Add Gapminder Health v.s. Income dataset
- Add generated Github contributions data for punch card visualization
- Add Anscombe's Quartet dataset
- Change date format in weather data so that it can be parsed in all browsers. Apparently YYYY/MM/DD is fine. Can also omit hours now.
- Decode origins in cars dataset
- Add Unemployment Across Industries in US
- Fixed the date parsing on the CrossFilter datasets -- an older version of the data was copied over on initial import. A script is now available via
npm run flights N
to re-sampleN
records from the originalflights-3m.csv
dataset.
- Add
seattle-weather
dataset. Transformed with https://gist.github.com/domoritz/acb8c13d5dadeb19636c
- Initial import from vega and vega-lite
- Change field names in
cars.json
to be more descriptive (hp
toHorsepower
)