Representation discovery for MDPs using bisimulation metrics

4Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

Abstract

We provide a novel, flexible, iterative refinement algorithm to automatically construct an approximate statespace representation for Markov Decision Processes (MDPs). Our approach leverages bisimulation metrics, which have been used in prior work to generate features to represent the state space of MDPs. We address a drawback of this approach, which is the expensive computation of the bisimulation metrics. We propose an algorithm to generate an iteratively improving sequence of state space partitions. Partial metric computations guide the representation search and provide much lower space and computational complexity, while maintaining strong convergence properties. We provide theoretical results guaranteeing convergence as well as experimental illustrations of the accuracy and savings (in time and memory usage) of the new algorithm, compared to traditional bisimulation metric computation.

Cite

CITATION STYLE

APA

Ruan, S. S., Comanici, G., Panangaden, P., & Precup, D. (2015). Representation discovery for MDPs using bisimulation metrics. In Proceedings of the National Conference on Artificial Intelligence (Vol. 5, pp. 3578–3584). AI Access Foundation. https://doi.org/10.1609/aaai.v29i1.9747

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free