Solving uncertain MDPs by reusing state information and plans

Ping Hou; William Yeoh; Tran Cao Son

Conference ProceedingsOPEN ACCESS

Solving uncertain MDPs by reusing state information and plans

Proceedings of the National Conference on Artificial Intelligence (2014) 3 2285-2292

DOI: 10.1609/aaai.v28i1.9029

1Citations

12Readers

Abstract

While MDPs are powerful tools for modeling sequential decision making problems under uncertainty, they are sensitive to the accuracy of their parameters. MDPs with uncertainty in their parameters are called Uncertain MDPs. In this paper, we introduce a general framework that allows off-theshelf MDP algorithms to solve Uncertain MDPs by planning based on currently available information and replan if and when the problem changes. We demonstrate the generality of this approach by showing that it can use the VI, TVI, ILAO∗, LRTDP, and UCT algorithms to solve Uncertain MDPs. We experimentally show that our approach is typically faster than replanning from scratch and we also provide a way to estimate the amount of speedup based on the amount of information being reused.

Cite

CITATION STYLE

APA

Hou, P., Yeoh, W., & Son, T. C. (2014). Solving uncertain MDPs by reusing state information and plans. In Proceedings of the National Conference on Artificial Intelligence (Vol. 3, pp. 2285–2292). AI Access Foundation. https://doi.org/10.1609/aaai.v28i1.9029

Solving uncertain MDPs by reusing state information and plans

Abstract

Cite

Register to see more suggestions