MetaLight: Value-based meta-reinforcement learning for traffic signal control

Xinshi Zang; Huaxiu Yao; Guanjie Zheng; Nan Xu; Kai Xu; Zhenhui Li

Conference ProceedingsOPEN ACCESS

MetaLight: Value-based meta-reinforcement learning for traffic signal control

AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (2020) 1153-1160

DOI: 10.1609/aaai.v34i01.5467

121Citations

129Readers

Abstract

Using reinforcement learning for traffic signal control has attracted increasing interests recently. Various value-based reinforcement learning methods have been proposed to deal with this classical transportation problem and achieved better performances compared with traditional transportation methods. However, current reinforcement learning models rely on tremendous training data and computational resources, which may have bad consequences (e.g., traffic jams or accidents) in the real world. In traffic signal control, some algorithms have been proposed to empower quick learning from scratch, but little attention is paid to learning by transferring and reusing learned experience. In this paper, we propose a novel framework, named as MetaLight, to speed up the learning process in new scenarios by leveraging the knowledge learned from existing scenarios. MetaLight is a value-based meta-reinforcement learning workflow based on the representative gradient-based meta-learning algorithm (MAML), which includes periodically alternate individual-level adaptation and global-level adaptation. Moreover, MetaLight improves the-state-of-the-art reinforcement learning model FRAP in traffic signal control by optimizing its model structure and updating paradigm. The experiments on four real-world datasets show that our proposed MetaLight not only adapts more quickly and stably in new traffic scenarios, but also achieves better performance.

Cite

CITATION STYLE

APA

Zang, X., Yao, H., Zheng, G., Xu, N., Xu, K., & Li, Z. (2020). MetaLight: Value-based meta-reinforcement learning for traffic signal control. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 1153–1160). AAAI press. https://doi.org/10.1609/aaai.v34i01.5467

MetaLight: Value-based meta-reinforcement learning for traffic signal control

Abstract

Cite

Register to see more suggestions