- To develop a methadology that can predict train delays.
- To select an appropriate Platform for implementing the code.
- Perform the exploratory analysis on the train delay data.
- Forecasted Data Points
- Forecasted Data Samples Overlapped over Original Samples
In the study, we proposed a novel approach that sequentially concatenates a machine learning algorithm with a statistical model, i.e. PROPHET. This approach is able to consider both network-scale spatial-temporal correlations among traffic flows and location specific traffic characteristics. Its postprocessing by the ARIMA analysis can extract traffic features from the residuals of Neural Network, hence, significantly improve the accuracy of prediction. In addition, this approach is also able to capture the sharp nonlinearity of traffic flows. In the context of using seven years long railway traffic time series data set, the case study provides numerical evidence that the predictive capacity of the proposed prophet model. It is an effective and efficient tool for delay forecasting prediction. It is noteworthy that the MSE is significantly reduced in the comparison of ARIMA. It clearly reflects the important role of network-wide spatial correlations in traffic prediction. It is beneficial if the network traffic flows are treated as a whole in the prediction. The postprocess significantly improves the accuracy of delay prediction. In the future research, the proposed approach may be applied to traffic prediction for a large dataset over a wide range of trains if data is available. It may also be compared with those preprocess machine learning algorithms.
We would like to express my special thanks of gratitude to RDSO, who facilitated this project and provided us with key points and guidance for the same.