Skip to content

Latest commit

 

History

History
51 lines (42 loc) · 2.65 KB

README.md

File metadata and controls

51 lines (42 loc) · 2.65 KB

Open Paths Analysis repository

This is a cookbook repository of recipes for analyzing Open Paths location data. Mainly SQL but anything else is welcomed.

Open Paths is a project that allows you to collect information about where you have been on your mobile phone. It does not have to use your GPS and therefore it does not ruin your battery, so therefore you can silently collect continuosly your location data. The project allows you to download the data as CSV.

The goal of this project is to come up with a set of reciepes that can be useful when anlyzing this data. Like for example: How do I find the flights done in this time period, or Which area normally you go out on weekends? Where do you live? Where do you work? All these questions can be answered with a bit of analysis.

For the time being these are just a collection of SQL reciepes based on a table structure as follows:

  • openpaths: This the raw data from openpaths. It basically is a collection of location traces, lat/lon and timestamp.
  • openpaths_segments: If you connect every point with the follwing one you create a table of "lines" or "segments" and for each segment you can determine the speed and the distance which can be very useful for lot of analysis.

To process openpaths_segments from openpaths you can use the following SQL:

    INSERT INTO openpaths_segments(the_geom,distance,start_timestamp,end_timestamp,speed,interv_sec,session_id) 
        SELECT * FROM (
        WITH segments as 
            (SELECT 
                the_geom as the_geom_start,
                (SELECT the_geom from openpaths as e WHERE e.cartodb_id=s.cartodb_id+1) as the_geom_end,
                trace_timestamp as start_date, 
                (SELECT trace_timestamp from openpaths as e WHERE e.cartodb_id=s.cartodb_id+1) as end_date
            FROM openpaths as s 
            ORDER BY cartodb_id)
        SELECT 
            ST_Multi(ST_MakeLine(the_geom_start,the_geom_end)) as the_geom, 
            ST_Distance(the_geom_start::geography,the_geom_end::geography) as distance, 
            start_date,
            end_date, 
            CASE WHEN 
                extract(epoch from (end_date-start_date))=0 THEN 0
            ELSE 
                ST_Distance(the_geom_start::geography,the_geom_end::geography)/extract(epoch from (end_date-start_date))  
            END as speed,
            extract(epoch from (end_date-start_date)) as interv_sec
    FROM segments) as lala

Tools being used:

Right now we are only using CartoDB as a way to analyze the data. CartoDB is an Open Source tool based on PostgreSQL and PostGIS.