Bayesian learning of other agents' finite controllers for interactive POMDPs

Alessandro Panella; Piotr Gmytrasiewicz

Conference ProceedingsOPEN ACCESS

Bayesian learning of other agents' finite controllers for interactive POMDPs

30th AAAI Conference on Artificial Intelligence, AAAI 2016 (2016) 2530-2536

DOI: 10.1609/aaai.v30i1.10136

7Citations

21Readers

Abstract

We consider an autonomous agent operating in a stochastic, partially-observable, multiagent environment, that explicitly models the other agents as probabilistic deterministic finitestate controllers (PDFCs) in order to predict their actions.We assume that such models are not given to the agent, but instead must be learned from (possibly imperfect) observations of the other agents' behavior. The agent maintains a belief over the other agents' models, that is updated via Bayesian inference. To represent this belief we place a flexible stickbreaking distribution over PDFCs, that allows the posterior to concentrate around controllers whose size is not bounded and scales with the complexity of the observed data. Since this Bayesian inference task is not analytically tractable, we devise a Markov chain Monte Carlo algorithm to approximate the posterior distribution. The agent then embeds the result of this inference into its own decision making process using the interactive POMDP framework. We show that our learning algorithm can learn agent models that are behaviorally accurate for problems of varying complexity, and that the agent's performance increases as a result.

Cite

CITATION STYLE

APA

Panella, A., & Gmytrasiewicz, P. (2016). Bayesian learning of other agents’ finite controllers for interactive POMDPs. In 30th AAAI Conference on Artificial Intelligence, AAAI 2016 (pp. 2530–2536). AAAI press. https://doi.org/10.1609/aaai.v30i1.10136

Bayesian learning of other agents' finite controllers for interactive POMDPs

Abstract

Cite

Register to see more suggestions