Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge changes from TfWM v4 #70

Open
wants to merge 87 commits into
base: oweno-tfwm
Choose a base branch
from

Conversation

mem48
Copy link
Contributor

@mem48 mem48 commented Oct 3, 2024

No description provided.

oweno-tfwm and others added 30 commits August 15, 2023 21:40
… columns not being explicitly set - and some fields assumed to be present in route table (route_desc) are not present in BODS data
…quencies' and 'feed_info' when reading / writing BODS data.

Removed assumptions about table names, all additional tables encountered in zip are read and written.

also performance improvement when writing.
…ay anything else that it wasn't expecting. OK for the GTFS tables generated by this tool from CIF files, but caused data loss when using BODS data.

modified to keep all encountered columns and tables - passes through if there isn't any specific code for the table in question.
renamed ZZ to reflect it's a generic obfuscated freight, not a specific operator
…lendar_date exists with no calendar (rare but valid data configuration)
merge up to latest leeds master
1) make load and merge agnostic about expected tables and fields, so that optional tables and fields are at least passed through rather than discarded. (allows GTFS generated by this tool and BODS to be merged)
2) tighten up data type expectations for fields in BODS data
3) fix bugs when calendar dates have no calendar (this is a legitimate but range arrangement of data)
4) performance improvement while formatting files before writing to file system (3 minute improvement when writing 400mb zipped GTFS file (all GB BODS extract)

There is one line in merge that needs resolution - complex line that I couldn't understand so commented it out.

calendar_dates <- calendar_dates[sapply(calendar_dates, function(x){ifelse(is.null(nrow(x)),0,nrow(x))}) > 0]

no idea what it's supposed to do and no explanatory comment
…o the caller being a different shaped structure being passed back
change to use data.table (likely a breaking change)
performance improvements
add setting of route_long_name "Train from..." to change based on mode
…bles, change default options to highest performing write combination
…as also been extensively profiled and reasonable level of optimisation applied
…of date+time

(this means if github can't be reached should only nag once per day to update instead of on every single package load)
2) add timeout to update check on package load and make it short (but make default when explicitly called long)
(atoc data doesn't seem to have many non-public moves in it to start with)
…t completely follow the schema.

Also add 'metro' to long name for underground & Tyne&Wear metro services
…ass time

2) when looking at non-public services include passes marked as 'no pickup/dropoff available' from a GTFS perspective
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants