Passenger Flow Forecasting in Public Transportation Networks Under Event Conditions

University of Twente Master thesis: essay.utwente.nl/97536/

This GitHub repository contains the implementations for the various passenger flow forecasting algorithms explored in the Master thesis of Computer Science at the University of Twente in collaboration with Info Support.

The code is copyrighted by Info Support B.V. in the Netherlands and published under the Apache-2.0 license.

Why?

Public transportation networks are an essential part of public infrastructure and have the potential to reduce society's dependency on personal cars. Making public transportation a more attractive option requires better passenger demand forecasts, as these can be used to guarantee enough seating capacity for everyone, especially under event conditions (such as concerts, festivals, and sports).

Even though passenger demand in public transportation tends to be quite regular, accurately forecasting the additional peaks of passengers caused by large events has proven to be quite challenging. This research attempts to accurately forecast these peaks in additional demand based on some details about the events. The results of this research did not confirm the hypothesis, but some positive results show a potential for future works that have access to better quality data. Read the thesis for more details: essay.utwente.nl/97536/

Visualized passenger flow throughout the day.

Running the code

Repository structure

The repository is structured as follows:

models/ contains the implementations of the various Machine Learning algorithms.
train/ contains the Jupyter notebooks used to train the Machine Learning algorithms.
analysis.ipynb is used for general data analysis and to determine the SARIMA order.
eventsize.ipynb is used to visualize the relationship between the capacity of an event's venue and additional passengers in public transit (Section 4).
generate-results.ipynb uses the trained ML algorithms to generate forecasting results up to 72 hours into the future.
forecasting-results.ipynb visualizes the performance of the ML algorithms in tables and figures.

Any files related to the private NS dataset are omitted from this code repository.

Getting started

The steps below describe what is needed to get started with the code in this repository:

Create a Poetry virtual environment based on Python 3.11 and install the dependencies from pyproject.toml.
Manually install PyTorch and PyG in the virtual Python environment.
Download the hourly OD ridership data from the BART website to data/bart-od-{year}.csv.gz: https://www.bart.gov/about/reports/ridership.
Parse and convert OD matrices to edge-level passenger flow data using read_bart(...) and compute_flow(...) from util/graph.py.
Train ML models with the Jupyter notebooks in train/.
Evaluate the results with generate-results.ipynb and forecasting-results.ipynb.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Passenger Flow Forecasting in Public Transportation Networks Under Event Conditions

Why?

Running the code

Repository structure

Getting started

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
data		data
figures		figures
models		models
train		train
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analysis.ipynb		analysis.ipynb
eventsize.ipynb		eventsize.ipynb
forecasting-results.ipynb		forecasting-results.ipynb
generate-results.ipynb		generate-results.ipynb
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

License

jeffreybakker/passenger-flow-forecasting

Folders and files

Latest commit

History

Repository files navigation

Passenger Flow Forecasting in Public Transportation Networks Under Event Conditions

Why?

Running the code

Repository structure

Getting started

About

Resources

License

Stars

Watchers

Forks

Languages