Anticipatory learning classifier systems and factored reinforcement learning

Olivier Sigaud; Martin V. Butz; Olga Kozlova; Christophe Meyer

Conference Proceedings

Anticipatory learning classifier systems and factored reinforcement learning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5499 LNAI 321-333

DOI: 10.1007/978-3-642-02565-5_18

8Citations

23Readers

Get full text

Abstract

Factored Reinforcement Learning (frl) is a new technique to solve Factored Markov Decision Problems (fmdps) when the structure of the problem is not known in advance. Like Anticipatory Learning Classifier Systems (alcss), it is a model-based Reinforcement Learning approach that includes generalization mechanisms in the presence of a structured domain. In general, frl and alcss are explicit, state-anticipatory approaches that learn generalized state transition models to improve system behavior based on model-based reinforcement learning techniques. In this contribution, we highlight the conceptual similarities and differences between frl and alcss, focusing on the one hand on spiti, an instance of frl method, and on alcss, macs and xacs, on the other hand. Though frl systems seem to benefit from a clearer theoretical grounding, an empirical comparison between spiti and xacs on two benchmark problems reveals that the latter scales much better than the former when some combination of state variables do not occur. Based on this finding, we discuss the mechanisms in xacs that result in the better scalability and propose importing these mechanisms into frl systems. © 2009 Springer Berlin Heidelberg.

Cite

CITATION STYLE

APA

Sigaud, O., Butz, M. V., Kozlova, O., & Meyer, C. (2009). Anticipatory learning classifier systems and factored reinforcement learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5499 LNAI, pp. 321–333). https://doi.org/10.1007/978-3-642-02565-5_18

Anticipatory learning classifier systems and factored reinforcement learning

Abstract

Cite

Register to see more suggestions