Temporal shift reinforcement learning

Deepak George Thomas; Tichakorn Wongpiromsarn; Ali Jannesari

Conference ProceedingsOPEN ACCESS

Temporal shift reinforcement learning

EuroMLSys 2022 - Proceedings of the 2nd European Workshop on Machine Learning and Systems (2022) 95-100

DOI: 10.1145/3517207.3526968

0Citations

8Readers

Get full text

Abstract

The function approximators employed by traditional image-based Deep Reinforcement Learning (DRL) algorithms usually lack a temporal learning component and instead focus on learning the spatial component. We propose a technique, Temporal Shift Reinforcement Learning (TSRL), wherein both temporal, as well as spatial components are jointly learned. Moreover, TSRL does not require additional parameters to perform temporal learning. We show that TSRL outperforms the commonly used frame stacking heuristic on all of the Atari environments we test on while beating the SOTA for all except one of them. This investigation has implications in the robotics as well as sequential decision-making domains. Our code is available at-https://github.com/Deepakgthomas/TSM-RL

Author supplied keywords

Cite

CITATION STYLE

APA

Thomas, D. G., Wongpiromsarn, T., & Jannesari, A. (2022). Temporal shift reinforcement learning. In EuroMLSys 2022 - Proceedings of the 2nd European Workshop on Machine Learning and Systems (pp. 95–100). Association for Computing Machinery, Inc. https://doi.org/10.1145/3517207.3526968

Temporal shift reinforcement learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions