Collect and make available metrics data #2

jmatsushita · 2015-08-17T11:47:34Z

What is the minimum viable data structure? The first best thing with regards to collecting and publishing metrics data? Versioned JSON files on Github? NoSQL database? Open scrapers on https://morph.io ? CKAN ?

jmatsushita · 2015-08-17T12:11:40Z

Also http://dat-data.com/ ?

andrew · 2015-08-19T20:33:13Z

I was pondering this over the weekend, Elasticsearch seems like a good fit for the kind of data modeling planned, it has a very flexible schema, can easily grow to handle more data and replication options and has some very powerful ways with "Aggregations": https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations.html

Also since v1.7 you can export whole dumps of the data for sharing publicly: https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-snapshots.html

jmatsushita · 2015-08-20T10:17:05Z

Elasticsearch as the main store? It would probably be the smoothest way to get started. Have you seen this about resilience.

I'm quite curious as to how well the MySQL / MongoDB combo is working in practice for @gousiosg with GHTorrent.

Another option is to borrow infrastructure like Big Query, Red Shift or Cloud Data Flow.

gousiosg · 2015-08-20T10:43:44Z

The MySQL and MongoDB combo is working quite well, scaling is just starting to become an issue. The real issue is consistency across the two.

jmatsushita added the feature label Aug 17, 2015

jmatsushita mentioned this issue Aug 20, 2015

Measurement agents specs #3

Open

jmatsushita mentioned this issue Aug 20, 2015

Develop data model for open repository of software metrics #1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collect and make available metrics data #2

Collect and make available metrics data #2

jmatsushita commented Aug 17, 2015

jmatsushita commented Aug 17, 2015

andrew commented Aug 19, 2015

jmatsushita commented Aug 20, 2015

gousiosg commented Aug 20, 2015

Collect and make available metrics data #2

Collect and make available metrics data #2

Comments

jmatsushita commented Aug 17, 2015

jmatsushita commented Aug 17, 2015

andrew commented Aug 19, 2015

jmatsushita commented Aug 20, 2015

gousiosg commented Aug 20, 2015