Spatio-temporal attention deep recurrent Q-network for POMDPs

1Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

One of the long-standing challenges for reinforcement learning agents is to deal with noisy environments. Although progress has been made in producing an agent capable of optimizing its environment in fully observable conditions, partial observability still remains a difficult task. In this paper, a novel model is proposed which inspired by human perception, utilizes two fundamental machine learning concepts, attention and memory, to better confront a noisy environment.

Cite

CITATION STYLE

APA

Etchart, M., Ladosz, P., & Mulvaney, D. (2019). Spatio-temporal attention deep recurrent Q-network for POMDPs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11804 LNAI, pp. 98–105). Springer Verlag. https://doi.org/10.1007/978-3-030-30241-2_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free