Solving risk-sensitive POMDPs with and without cost observations

13Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.

Abstract

Partially Observable Markov Decision Processes (POMDPs) are often used to model planning problems under uncertainty. The goal in Risk-Sensitive POMDPs (RS-POMDPs) is to find a policy that maximizes the probability that the cumulative cost is within some user-defined cost threshold. In this paper, unlike existing POMDP literature, we distinguish between the two cases of whether costs can or cannot be observed and show the empirical impact of cost observations. We also introduce a new search-based algorithm to solve RS-POMDPs and show that it is faster and more scalable than existing approaches in two synthetic domains and a taxi domain generated with real-world data.

Cite

CITATION STYLE

APA

Hou, P., Yeoh, W., & Varakantham, P. (2016). Solving risk-sensitive POMDPs with and without cost observations. In 30th AAAI Conference on Artificial Intelligence, AAAI 2016 (pp. 3138–3144). AAAI press. https://doi.org/10.1609/aaai.v30i1.10402

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free