Proximity-based non-uniform abstractions for approximate planning

4Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

In a deterministic world, a planning agent can be certain of the consequences of its planned sequence of actions. Not so, however, in dynamic, stochastic domains where Markov decision processes are commonly used. Unfortunately these suffer from the 'curse of dimensionality': if the state space is a Cartesian product of many small sets ('dimensions'), planning is exponential in the number of those dimensions. Our new technique exploits the intuitive strategy of selectively ignoring various di-mensions in different parts of the state space. The resulting non-uniformity has strong implications, since the approximation is no longer Markovian, requiring the use of a mod-ified planner. We also use a spatial and temporal proximity measure, which responds to continued planning as well as movement of the agent through the state space, to dynami-cally adapt the abstraction as planning progresses. We present qualitative and quantitative results across a range of experimental domains showing that an agent exploiting this novel approximation method successfully finds solu-tions to the planning problem using much less than the full state space. We assess and analyse the features of domains which our method can exploit. © 2012 AI Access Foundation.

Cite

CITATION STYLE

APA

Baum, J., Nicholson, A. E., & Dix, T. I. (2012). Proximity-based non-uniform abstractions for approximate planning. Journal of Artificial Intelligence Research, 43, 477–522. https://doi.org/10.1613/jair.3414

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free