A parallel algorithm for POMDP solution

Larry D. Pyeatt; Adele E. Howe

Conference Proceedings

A parallel algorithm for POMDP solution

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2000) 1809 73-83

DOI: 10.1007/10720246_6

2Citations

5Readers

Get full text

Abstract

Most exact algorithms for solving partially observable Mar-kov decision processes (POMDPs) are based on a form of dynamic pro-gramming in which a piecewise-linear and convex representation of the value function is updated at every iteration to more accurately appro-ximate the true value function. However, the process is computationally expensive, thus limiting the practical application of POMDPs in plan-ning. To address this current limitation, we present a parallel distribu-ted algorithm based on the Restricted Region method proposed by Cas-sandra, Littman and Zhang [1]. We compare performance of the parallel algorithm against a serial implementation Restricted Region.

Cite

CITATION STYLE

APA

Pyeatt, L. D., & Howe, A. E. (2000). A parallel algorithm for POMDP solution. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 1809, pp. 73–83). Springer Verlag. https://doi.org/10.1007/10720246_6

A parallel algorithm for POMDP solution

Abstract

Cite

Register to see more suggestions