Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Epic - Daskify RT segment speeds #592

Closed
20 of 21 tasks
tiffanychu90 opened this issue Dec 29, 2022 · 1 comment
Closed
20 of 21 tasks

Epic - Daskify RT segment speeds #592

tiffanychu90 opened this issue Dec 29, 2022 · 1 comment
Assignees
Labels
epic Representing research requests - large segments of work and their dependencies gtfs-rt Work related to GTFS-Realtime

Comments

@tiffanychu90
Copy link
Member

tiffanychu90 commented Dec 29, 2022

After receiving a research request, use this template to plan and track your work. Be sure to also add the appropriate project-level label to this issue (eg gtfs-rt, DLA).

Epic Information - Daskify RT speeds

Summary

  • Pilot the route segments concept to see how speeds get calculated at the by route segment-trip level
  • Assemble all the tables that are needed for pipeline and decide priority of which ones can be moved to dbt first
  • RT segment speeds pipeline Google Doc

Steps

  1. cut segments (route segments or stop segments)
  2. spatial join vehicle positions by route-direction to the segments (use delayed after looping within routes for operator) (A1)
  3. pare down vehicle positions to just enter/exit within a segment + drop unusable trips (trips < 10 min of vp data). should exclude trips with too little info or trip_id is None at the start, then pare down (A2)
  4. do linear referencing within segments to get distance_elapsed and time_elapsed, and calculate speeds. save as partitioned parquets (A3)

Issues

Set up tables for exploration

Diagnostics

Refine Data Processing

@tiffanychu90 tiffanychu90 added the epic Representing research requests - large segments of work and their dependencies label Dec 29, 2022
@tiffanychu90 tiffanychu90 added the gtfs-rt Work related to GTFS-Realtime label Jan 11, 2023
@tiffanychu90
Copy link
Member Author

Speeds by segments analytics pipeline takes roughly ~1.5 hrs to create the 2 open data layers related to speeds.
Next: use new epic for road segments.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
epic Representing research requests - large segments of work and their dependencies gtfs-rt Work related to GTFS-Realtime
Projects
None yet
Development

No branches or pull requests

3 participants