Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

merge up to latest leeds master #2

Merged
merged 14 commits into from
Aug 25, 2023
Merged

Conversation

oweno-tfwm
Copy link
Owner

No description provided.

oweno-tfwm and others added 14 commits August 9, 2023 11:47
… more than 10 minutes,

which is longer than the multi threaded section before it.

changing to call data.table::rbindlist reduces this to 20 seconds or so.
In order for this to work the columns need to stay in the same order
- which we achieve by flipping the join round so that the target is on the left hand side of the output instead of the right hand side
- which means that the columns stay in the same order, so we don't need to match on names either.
- longer than the subsequent multi-threaded section.
Replacing with a transpose of the dataframe reduces this to 3s or so.
The transpose causes the named dataframe to be coerced into a unnamed vector of int, so we have to index into the vector by number, and convert the date coerced into int back into a date.
make console message clearer about what action is being carried out
…ave a NA route_type, so when loading into most GTFS databases violate non-null constraint on route_type column. While they are a valid timetabled train (mostly freight), we're not interested in them if we're doing passenger service timetabling.
Performance + Verbosity of validation + thoroughness of fixing
@oweno-tfwm oweno-tfwm merged commit 772e072 into oweno-tfwm:master Aug 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants