Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mar open data #1047

Merged
merged 6 commits into from
Mar 14, 2024
Merged

Mar open data #1047

merged 6 commits into from
Mar 14, 2024

Conversation

tiffanychu90
Copy link
Member

@tiffanychu90 tiffanychu90 commented Mar 14, 2024

open data

  • Add a function to either find Amtrak routes primarily in CA or outside CA, because we want to include some for HQTA
  • Swap order for how scripts are run in gtfs_funnel
  • When prepping vp_condensed, use dask.delayed instead of map_partitions
  • ticket opened 3/14/24
  • Epic - Open Data Publishing 2024 #991

segment speeds

  • run March 2024 for segment speeds
  • add back ability to rerun 1 script
  • there are more shared functions for segment speeds and rt vs schedule now for how we concatenate across dates (time_series_utils), attach route information, add operator identifiers, etc
  • TODO: break out average segment speeds (derived from segment data) from average speeds (derived from segment data, but now we should derive from rt_stop_times 1st/last stop) so it's clearer

rt vs schedule

  • run March 2024 for rt vs schedule and rt_stop_times
  • add segment_speed_utils.metrics file

performance audit

  • downloading schedule and vp ~20 min
  • vp preprocessing ~30 min + schedule data preprocessing ~5 min
  • stop segments with gtfs-segments ~24 min
  • nearest neighbors 10-12 min ~15 min (x2 for segment speeds and rt_stop_times)
  • interpolation ~10 min (x2)
  • get speeds: <1 min (x2)
  • average speeds and export: ~6 min
  • rt vs schedule trips and route: ~20-22 min
  • traffic ops: ~5 min
  • hqta: ~8-9 min

3 similar comments
@tiffanychu90 tiffanychu90 merged commit bbe0026 into main Mar 14, 2024
6 checks passed
@tiffanychu90 tiffanychu90 deleted the mar-open-data branch March 14, 2024 23:06
1 similar comment
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant