Abstract
The rapid growth of urban metro systems requires novel strategies to guarantee operational dependability and energy efficiency. This article describes a new way to use deep reinforcement learning (DRL) to help metro networks with predictive maintenance that adapts to changing conditions and energy optimization. We used real-world transit data from the General Transit Feed Specification (GTFS) to model the maintenance scheduling and energy management problem as a Markov Decision Process. This included important operational metrics like peak-hour demand, train arrival times, and station stop densities. A custom reinforcement learning environment mimics the changing conditions of metro operations. Deep Q-Networks (DQNs) and Proximal Policy Optimization (PPO) sophisticated deep reinforcement learning techniques were used to identify the optimal policies for decreasing energy consumption and downtime. The PPO hyperparameters were additionally optimized using Bayesian optimization by implementing Optuna, which produces a far greater performance than baseline DQNs and basic PPO. Comparative tests showed that our improved DRL-based method improves the accuracy of predictive maintenance and the efficiency of energy use, which lowers operational costs and raises the dependability of the service. These results show that advanced learning and optimization techniques could be added to public transportation systems in cities. This could lead to more sustainable and smart transportation management in big cities.
Author supplied keywords
Cite
CITATION STYLE
Rziki, M. H., Hadbi, A. E., Boutahir, M. K., & Abounaima, M. C. (2025). Adaptive Predictive Maintenance and Energy Optimization in Metro Systems Using Deep Reinforcement Learning. Sustainability (Switzerland), 17(11). https://doi.org/10.3390/su17115096
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.