A tutorial on partially observable Markov decision processes

Michael L. Littman

Journal Article

A tutorial on partially observable Markov decision processes

Littman M

Journal of Mathematical Psychology (2009) 53(3) 119-125

DOI: 10.1016/j.jmp.2009.01.005

95Citations

301Readers

Get full text

Abstract

The partially observable Markov decision process (POMDP) model of environments was first explored in the engineering and operations research communities 40 years ago. More recently, the model has been embraced by researchers in artificial intelligence and machine learning, leading to a flurry of solution algorithms that can identify optimal or near-optimal behavior in many environments represented as POMDPs. The purpose of this article is to introduce the POMDP model to behavioral scientists who may wish to apply the framework to the problem of understanding normative behavior in experimental settings. The article includes concrete examples using a publicly-available POMDP solution package. © 2009 Elsevier Inc. All rights reserved.

Author supplied keywords

Cite

CITATION STYLE

APA

Littman, M. L. (2009). A tutorial on partially observable Markov decision processes. Journal of Mathematical Psychology, 53(3), 119–125. https://doi.org/10.1016/j.jmp.2009.01.005

A tutorial on partially observable Markov decision processes

Abstract

Author supplied keywords

Cite

Register to see more suggestions