Deep Adversarial Reinforcement Learning with Noise Compensation by Autoencoder

Kohei Ohashi; Kosuke Nakanishi; Wataru Sasaki; Yuji Yasui; Shin Ishii

Journal ArticleOPEN ACCESS

Deep Adversarial Reinforcement Learning with Noise Compensation by Autoencoder

IEEE Access (2021) 9 143901-143912

DOI: 10.1109/ACCESS.2021.3121751

6Citations

13Readers

Abstract

We present a new adversarial learning method for deep reinforcement learning (DRL). Based on this method, robust internal representation in a deep Q-network (DQN) was introduced by applying adversarial noise to disturb the DQN policy; however, it was compensated for by the autoencoder network. In particular, we proposed the use of a new type of adversarial noise: it encourages the policy to choose the worst action leading to the worst outcome at each state. When the proposed method, called deep Q-W-network regularized with an autoencoder (DQWAE), was applied to seven different games in an Atari 2600, the results were convincing. DQWAE exhibited greater robustness against the random/adversarial noise added to the input and accelerated the learning process more than the baseline DQN. When applied to a realistic automatic driving simulation, the proposed DRL method was found to be effective at rendering the acquired policy robust against random/adversarial noise.

Author supplied keywords

Cite

CITATION STYLE

APA

Ohashi, K., Nakanishi, K., Sasaki, W., Yasui, Y., & Ishii, S. (2021). Deep Adversarial Reinforcement Learning with Noise Compensation by Autoencoder. IEEE Access, 9, 143901–143912. https://doi.org/10.1109/ACCESS.2021.3121751

Deep Adversarial Reinforcement Learning with Noise Compensation by Autoencoder

Abstract

Author supplied keywords

Cite

Register to see more suggestions