Bayesian Inference for Least Squares Temporal Difference Regularization

Nikolaos Tziortziotis; Christos Dimitrakakis

Conference ProceedingsOPEN ACCESS

Bayesian Inference for Least Squares Temporal Difference Regularization

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10535 LNAI 126-141

DOI: 10.1007/978-3-319-71246-8_8

1Citations

9Readers

Abstract

This paper proposes a fully Bayesian approach for Least-Squares Temporal Differences (LSTD), resulting in fully probabilistic inference of value functions that avoids the overfitting commonly experienced with classical LSTD when the number of features is larger than the number of samples. Sparse Bayesian learning provides an elegant solution through the introduction of a prior over value function parameters. This gives us the advantages of probabilistic predictions, a sparse model, and good generalisation capabilities, as irrelevant parameters are marginalised out. The algorithm efficiently approximates the posterior distribution through variational inference. We demonstrate the ability of the algorithm in avoiding overfitting experimentally.

Cite

CITATION STYLE

APA

Tziortziotis, N., & Dimitrakakis, C. (2017). Bayesian Inference for Least Squares Temporal Difference Regularization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10535 LNAI, pp. 126–141). Springer Verlag. https://doi.org/10.1007/978-3-319-71246-8_8

Bayesian Inference for Least Squares Temporal Difference Regularization

Abstract

Cite

Register to see more suggestions