Optimistic AIXI

Peter Sunehag; Marcus Hutter

Conference Proceedings

Optimistic AIXI

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7716 LNAI 312-321

DOI: 10.1007/978-3-642-35506-6_32

5Citations

6Readers

Get full text

Abstract

We consider extending the AIXI agent by using multiple (or even a compact class of) priors. This has the benefit of weakening the conditions on the true environment that we need to prove asymptotic optimality. Furthermore, it decreases the arbitrariness of picking the prior or reference machine. We connect this to removing symmetry between accepting and rejecting bets in the rationality axiomatization of AIXI and replacing it with optimism. Optimism is often used to encourage exploration in the more restrictive Markov Decision Process setting and it alleviates the problem that AIXI (with geometric discounting) stops exploring prematurely. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Sunehag, P., & Hutter, M. (2012). Optimistic AIXI. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7716 LNAI, pp. 312–321). https://doi.org/10.1007/978-3-642-35506-6_32

Optimistic AIXI

Abstract

Author supplied keywords

Cite

Register to see more suggestions