Predicting Trains Delays using a Two-level Machine Learning Approach

3Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Train delay is a critical problem in railway systems. A previous prediction of delays is a critical issue advantageous for passengers to re-plan their journeys more reliably. It is also essential for railway operators to control the feasibility of timetable realization for more efficient train schedules. This paper aims to present a novel two-level Light Gradient Boosting Machine (LightGBM) approach that combines classification and regression in a hybrid model. It was proposed to predict passenger train delays on the Tunisian railway. The first level indicates the class of delay, where the delays are divided into intervals of 5 minutes ([0,5], [6,10], …, [>60]), 13 classes in total were obtained. The second level then predicts the actual delay in minutes, considering the expected delay class at the first level. This model was trained and tested based on the historical data of train operation collected by the Tunisian National Railways Company (SNCFT) and infrastructure characteristics. Our methodology consists of the following phases: data collection, data cleaning, complete data analysis, feature engineering, modeling and evaluation. The obtained results indicate that the two-level approach based on the LightGBM model outperforms the one-level method. It also outperformed the benchmark models.

Cite

CITATION STYLE

APA

Laifa, H., Khcherif, R., & Ghezala, H. B. (2022). Predicting Trains Delays using a Two-level Machine Learning Approach. In International Conference on Agents and Artificial Intelligence (Vol. 3, pp. 737–744). Science and Technology Publications, Lda. https://doi.org/10.5220/0010898300003116

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free