Properly acting under partial observability with action feasibility constraints

Caroline P. Carvalho Chanel; Florent Teichteil-Königsbuch

Conference ProceedingsOPEN ACCESS

Properly acting under partial observability with action feasibility constraints

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8188 LNAI(PART 1) 145-161

DOI: 10.1007/978-3-642-40988-2_10

2Citations

6Readers

Abstract

We introduce Action-Constrained Partially Observable Markov Decision Process (AC-POMDP), which arose from studying critical robotic applications with damaging actions. AC-POMDPs restrict the optimized policy to only apply feasible actions: each action is feasible in a subset of the state space, and the agent can observe the set of applicable actions in the current hidden state, in addition to standard observations. We present optimality equations for AC-POMDPs, which imply to operate on α-vectors defined over many different belief subspaces. We propose an algorithm named PreCondition Value Iteration (PCVI), which fully exploits this specific property of AC-POMDPs about α-vectors. We also designed a relaxed version of PCVI whose complexity is exponentially smaller than PCVI. Experimental results on POMDP robotic benchmarks with action feasibility constraints exhibit the benefits of explicitly exploiting the semantic richness of action-feasibility observations in AC-POMDPs over equivalent but unstructured POMDPs. © 2013 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Carvalho Chanel, C. P., & Teichteil-Königsbuch, F. (2013). Properly acting under partial observability with action feasibility constraints. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8188 LNAI, pp. 145–161). https://doi.org/10.1007/978-3-642-40988-2_10

Properly acting under partial observability with action feasibility constraints

Abstract

Author supplied keywords

Cite

Register to see more suggestions