ARKAQ-Learning: Autonomous state space segmentation and policy generation

Alp Sardaǧ; H. Levent Akin

Conference Proceedings

ARKAQ-Learning: Autonomous state space segmentation and policy generation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3733 LNCS 512-523

DOI: 10.1007/11569596_54

1Citations

5Readers

Get full text

Abstract

A real world environment is often partially observable by the agents either because of noisy sensors or incomplete perception. Autonomous strategy planning under uncertainty has two major challenges. First, autonomous segmentation of the state space for a given task; Second, emerging complex behaviors that deal with each state segment. This paper suggests a new approach that handles both by utilizing combination of various techniques, namely ARKAQ-Learning (ART 2-A networks augmented with Kalman Filters and Q-Learning). The algorithm is an online algorithm and it has low space and computational complexity. The algorithm was run for some well known partially observable Markov decision process problems. World Model Generator could reveal the hidden states, mapping non-Markovian model to Markovian internal state space. Policy Generator could build the optimal policy on the internal Markovian state model. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Sardaǧ, A., & Akin, H. L. (2005). ARKAQ-Learning: Autonomous state space segmentation and policy generation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3733 LNCS, pp. 512–523). https://doi.org/10.1007/11569596_54

ARKAQ-Learning: Autonomous state space segmentation and policy generation

Abstract

Cite

Register to see more suggestions