You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
My initial exercise of taking the Context questions and translating them into a YAML spec used out of date questions.
The final Context v1 spec should be similar in structure (prefer nested properties, short property names, mindful datatypes) but up to date with the latest methodology requirements.
Example from initial exercise, for inspiration
name: 'E-Scooter Routes Traveled Interactive Map data'history:
original_purpose: 'E-Scooter pilot program analysis'other_purposes:
known: []potential:
- 'Commute frequency'
- 'Urban planning'funding:
funded_by: 'PBOT'dependencies: []experiments: []comments: ''composition:
instances:
- type: 'Ride aggregations by geospatial hexagon'count: 16000fields:
- longitude
- latitude
- segment name
- tripsrelationships: []self_contained: trueexternal_guarantees: falsedata_splits: ''evaluation_measures: ''comments: ''collection:
procedure: > Trip data was collected by various E-Scooter providers, including Bird and Lyft. Source data was collected at the ride level, including statistics such as start time, end time, max speed, average speed, and route taken.participants:
- Lyft
- Bird
- Lime
- Skip
- PBOTcompensation_structure: > Lyft, Bird, and Lime collect data for internal usage and analytics. Their incentive for sharing the data with PBOT is to allow the city to analyze ride history during the pilot to determine the outcomes of the pilot. Ultimately, to continue providing their services in the city of Portland.timeframe:
start: 2018/07/23end: 2018/11/20acquisition:
method: 'From E-Scooter providers to PBOT'frequency: 'weekly'validation: > Before publicly releasing the data, all trips under one minute in duration was dropped. All trips representing an outlier measured by avg speed were dropped.completeness:
sample: falsemissing_information: falsecomments: ''preprocessing:
methodology: > Trips from the entire pilot period were aggregated based on geospatial hexagons that span the city limits of Portland. Each hexagon has an associated segment name. Most segments include multiple hexagon. Some hexagons represent multiple segments, (e.g., road intersections).source_data_preserved: truesoftware_used:
- name: ArcGISthird_party: trueopen_source: falselink: ''aligned_with_motivations: truelimitations: > In this aggregated view, only geospatial frequency can be examined and validated. The original analysis includes route durations, trip frequency by time and day, trip frequency by E-Scooter provider, and many other details.comments: ''distribution:
channels:
- PBOT website as a downloadable CSVdoi: falseredundant_archival: truefirst_distribution_date: ''license: nullrestrictions:
access: nullredistribution: nullcomments: ''maintenance:
maintainers:
- 'PBOT'updates: nullerratum: nullobsolute_notification_procedure: nulldata_usage_tracking: nullextensions: nullcomments: ''legal:
collection_disclosure:
disclosure: trueprocedure: > As part of the 2018 E-Scooter pilot, the city publically stated data was being collected.collection_consent:
motivation_disclosure: trueconsent_protocol: 'Terms of use'ability_to_revoke_consent: falseindividual_legal_implications:
exposes_individuals_to_harm: falseexposes_individuals_to_legal_action: falsemitigation_procedure: > Only aggregated, and therefore anonymized, data was published.social_advantages:
groups_advantaged: []groups_disadvantaged: []details: ''mitigation_procedure: > Only aggregated, and therefore anonymized, data was published.privacy:
guarantees:
- Personally identifiable data would not be publishedprotections: []confidential: falsepersonally_identifiable: falseethics:
content_warnings: []worst_case_scenarios:
- Data is used to sabotage PBOT's transportation goals in order to drive more people to use private transportation methods. E-Scooters or otherwise.
- E-Scooter providers use this data to prey on vulnerable populations, such as underage young adults and children (e.g, ensuring a healthy stock of scooters outside of high schools).comments: > Since this dataset is derived from more detailed and more granular data owned by E-Scooter providers, there is nothing here that is enabling worst case scenarios beyond what these companies could do on their own already.
The text was updated successfully, but these errors were encountered:
My initial exercise of taking the Context questions and translating them into a YAML spec used out of date questions.
The final Context v1 spec should be similar in structure (prefer nested properties, short property names, mindful datatypes) but up to date with the latest methodology requirements.
Example from initial exercise, for inspiration
The text was updated successfully, but these errors were encountered: