Spatio-temporal attention deep recurrent Q-network for POMDPs

Mariano Etchart; Pawel Ladosz; David Mulvaney

Conference Proceedings

Spatio-temporal attention deep recurrent Q-network for POMDPs

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11804 LNAI 98-105

DOI: 10.1007/978-3-030-30241-2_9

1Citations

4Readers

Get full text

Abstract

One of the long-standing challenges for reinforcement learning agents is to deal with noisy environments. Although progress has been made in producing an agent capable of optimizing its environment in fully observable conditions, partial observability still remains a difficult task. In this paper, a novel model is proposed which inspired by human perception, utilizes two fundamental machine learning concepts, attention and memory, to better confront a noisy environment.

Cite

CITATION STYLE

APA

Etchart, M., Ladosz, P., & Mulvaney, D. (2019). Spatio-temporal attention deep recurrent Q-network for POMDPs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11804 LNAI, pp. 98–105). Springer Verlag. https://doi.org/10.1007/978-3-030-30241-2_9

Spatio-temporal attention deep recurrent Q-network for POMDPs

Abstract

Cite

Register to see more suggestions