MetaLight: Value-based meta-reinforcement learning for traffic signal control

121Citations
Citations of this article
129Readers
Mendeley users who have this article in their library.

Abstract

Using reinforcement learning for traffic signal control has attracted increasing interests recently. Various value-based reinforcement learning methods have been proposed to deal with this classical transportation problem and achieved better performances compared with traditional transportation methods. However, current reinforcement learning models rely on tremendous training data and computational resources, which may have bad consequences (e.g., traffic jams or accidents) in the real world. In traffic signal control, some algorithms have been proposed to empower quick learning from scratch, but little attention is paid to learning by transferring and reusing learned experience. In this paper, we propose a novel framework, named as MetaLight, to speed up the learning process in new scenarios by leveraging the knowledge learned from existing scenarios. MetaLight is a value-based meta-reinforcement learning workflow based on the representative gradient-based meta-learning algorithm (MAML), which includes periodically alternate individual-level adaptation and global-level adaptation. Moreover, MetaLight improves the-state-of-the-art reinforcement learning model FRAP in traffic signal control by optimizing its model structure and updating paradigm. The experiments on four real-world datasets show that our proposed MetaLight not only adapts more quickly and stably in new traffic scenarios, but also achieves better performance.

Cite

CITATION STYLE

APA

Zang, X., Yao, H., Zheng, G., Xu, N., Xu, K., & Li, Z. (2020). MetaLight: Value-based meta-reinforcement learning for traffic signal control. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 1153–1160). AAAI press. https://doi.org/10.1609/aaai.v34i01.5467

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free