A parallel algorithm for POMDP solution

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Most exact algorithms for solving partially observable Mar-kov decision processes (POMDPs) are based on a form of dynamic pro-gramming in which a piecewise-linear and convex representation of the value function is updated at every iteration to more accurately appro-ximate the true value function. However, the process is computationally expensive, thus limiting the practical application of POMDPs in plan-ning. To address this current limitation, we present a parallel distribu-ted algorithm based on the Restricted Region method proposed by Cas-sandra, Littman and Zhang [1]. We compare performance of the parallel algorithm against a serial implementation Restricted Region.

Cite

CITATION STYLE

APA

Pyeatt, L. D., & Howe, A. E. (2000). A parallel algorithm for POMDP solution. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 1809, pp. 73–83). Springer Verlag. https://doi.org/10.1007/10720246_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free