Bayesian reinforcement learning

Nikos Vlassis; Mohammad Ghavamzadeh; Shie Mannor; Pascal Poupart

Book Chapter

Bayesian reinforcement learning

Springer Verlag, (2012), 359-386

DOI: 10.1007/978-3-642-27645-3_11

47Citations

80Readers

Get full text

Abstract

This chapter surveys recent lines of work that use Bayesian techniques for reinforcement learning. In Bayesian learning, uncertainty is expressed by a prior distribution over unknown parameters and learning is achieved by computing a posterior distribution based on the data observed. Hence, Bayesian reinforcement learning distinguishes itself from other forms of reinforcement learning by explicitly maintaining a distribution over various quantities such as the parameters of the model, the value function, the policy or its gradient. This yields several benefits: a) domain knowledge can be naturally encoded in the prior distribution to speed up learning; b) the exploration/exploitation tradeoff can be naturally optimized; and c) notions of risk can be naturally taken into account to obtain robust policies.

Author supplied keywords

Covariance

Cite

CITATION STYLE

APA

Vlassis, N., Ghavamzadeh, M., Mannor, S., & Poupart, P. (2012). Bayesian reinforcement learning. In Adaptation, Learning, and Optimization (Vol. 12, pp. 359–386). Springer Verlag. https://doi.org/10.1007/978-3-642-27645-3_11

Bayesian reinforcement learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions