Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aggregating Metrics on the Operator Grain #1259

Merged
merged 5 commits into from
Oct 17, 2024
Merged

Aggregating Metrics on the Operator Grain #1259

merged 5 commits into from
Oct 17, 2024

Conversation

amandaha8
Copy link
Contributor

  • Calculating some of the metrics on the operator grain starting from vp_trip_metrics.
  • Incorporated changes into scripts/MAKEFILE.
  • Using this new dataset to create the charts within the GTFS Digest portfolio.

@amandaha8 amandaha8 merged commit 3369dd9 into main Oct 17, 2024
3 checks passed
@amandaha8 amandaha8 deleted the ah_gtfs branch October 17, 2024 22:02
@tiffanychu90
Copy link
Member

Can you make a follow-up PR that handles this:

In gtfs_analytics_data.yml: there are a lot many operator grains being created in merge_operator_data.py.

operator_metrics should be not be a 3rd one added, it should belong in one of the first 2.

 operator_profiles: "digest/operator_profiles"
 operator_sched_rt: "digest/operator_schedule_rt_category"
 operator_metrics: "digest/operator_metrics"

These can be consolidated in merge_operator_data.py. It should follow the pattern in merge_data.py, where these are the steps:

  1. concatenate_schedule_by_route_direction (grab all dates for schedule by route-dir grain)
  2. concatenate_speeds_by_route_direction (grab all dates for speeds by route-dir grain)
  3. concatenate_rt_vs_schedule_by_route_direction (grab all dates for rt vs schedule by route-dir grain)
  4. concatenate_crosswalk_organization (grab all dates for crosswalk by operator)
  5. merge_data_sources_by_route_direction (this is essentially a 1:1 merge of [1, 2, 3] and a m:1 merge to [4].

In your follow-up PR, the operator grain should replicate the relevant portions of that. The operator files in gtfs_analytics_data.yml should only show unique grains.

  • operator_profiles (operator grain)
  • operator_metrics (operator grain, should be worked into profiles)
  • operator_sched_rt_category (check the grain, it might be able to be worked into the above too)
  • operator_routes_map (operator-route grain)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants