Abstract
Train delay is a critical problem in railway systems. A previous prediction of delays is a critical issue advantageous for passengers to re-plan their journeys more reliably. It is also essential for railway operators to control the feasibility of timetable realization for more efficient train schedules. This paper aims to present a novel two-level Light Gradient Boosting Machine (LightGBM) approach that combines classification and regression in a hybrid model. It was proposed to predict passenger train delays on the Tunisian railway. The first level indicates the class of delay, where the delays are divided into intervals of 5 minutes ([0,5], [6,10], …, [>60]), 13 classes in total were obtained. The second level then predicts the actual delay in minutes, considering the expected delay class at the first level. This model was trained and tested based on the historical data of train operation collected by the Tunisian National Railways Company (SNCFT) and infrastructure characteristics. Our methodology consists of the following phases: data collection, data cleaning, complete data analysis, feature engineering, modeling and evaluation. The obtained results indicate that the two-level approach based on the LightGBM model outperforms the one-level method. It also outperformed the benchmark models.
Author supplied keywords
Cite
CITATION STYLE
Laifa, H., Khcherif, R., & Ghezala, H. B. (2022). Predicting Trains Delays using a Two-level Machine Learning Approach. In International Conference on Agents and Artificial Intelligence (Vol. 3, pp. 737–744). Science and Technology Publications, Lda. https://doi.org/10.5220/0010898300003116
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.