Restricted value iteration: Theory and algorithms

Weihong Zhang; Nevin L. Zhang

Journal ArticleOPEN ACCESS

Restricted value iteration: Theory and algorithms

Journal of Artificial Intelligence Research (2005) 23 123-165

DOI: 10.1613/jair.1379

7Citations

20Readers

Abstract

Value iteration is a popular algorithm for finding near optimal policies for POMDPs. It is inefficient due to the need to account for the entire belief space, which necessitates the solution of large numbers of linear programs. In this paper, we study value iteration restricted to belief subsets. We show that, together with properly chosen belief subsets, restricted value iteration yields near-optimal policies and we give a condition for determining whether a given belief subset would bring about savings in space and time. We also apply restricted value iteration to two interesting classes of POMDPs, namely informative POMDPs and near-discernible POMDPs. © 2005 AI Access Foundation. All rights reserved.

Cite

CITATION STYLE

APA

Zhang, W., & Zhang, N. L. (2005). Restricted value iteration: Theory and algorithms. Journal of Artificial Intelligence Research, 23, 123–165. https://doi.org/10.1613/jair.1379

Restricted value iteration: Theory and algorithms

Abstract

Cite

Register to see more suggestions