One of the long-standing challenges for reinforcement learning agents is to deal with noisy environments. Although progress has been made in producing an agent capable of optimizing its environment in fully observable conditions, partial observability still remains a difficult task. In this paper, a novel model is proposed which inspired by human perception, utilizes two fundamental machine learning concepts, attention and memory, to better confront a noisy environment.
CITATION STYLE
Etchart, M., Ladosz, P., & Mulvaney, D. (2019). Spatio-temporal attention deep recurrent Q-network for POMDPs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11804 LNAI, pp. 98–105). Springer Verlag. https://doi.org/10.1007/978-3-030-30241-2_9
Mendeley helps you to discover research relevant for your work.