An online kernel-based clustering approach for value function approximation

Nikolaos Tziortziotis; Konstantinos Blekas

Conference Proceedings

An online kernel-based clustering approach for value function approximation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7297 LNCS 182-189

DOI: 10.1007/978-3-642-30448-4_23

0Citations

6Readers

Get full text

Abstract

Value function approximation is a critical task in solving Markov decision processes and accurately modeling reinforcement learning agents. A significant issue is how to construct efficient feature spaces from samples collected by the environment in order to obtain an optimal policy. The particular study addresses this challenge by proposing an on-line kernel-based clustering approach for building appropriate basis functions during the learning process. The method uses a kernel function capable of handling pairs of state-action as sequentially generated by the agent. At each time step, the procedure either adds a new cluster, or adjusts the winning cluster's parameters. By considering the value function as a linear combination of the constructed basis functions, the weights are optimized in a temporal-difference framework in order to minimize the Bellman approximation error. The proposed method is evaluated in numerous known simulated environments. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Tziortziotis, N., & Blekas, K. (2012). An online kernel-based clustering approach for value function approximation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7297 LNCS, pp. 182–189). https://doi.org/10.1007/978-3-642-30448-4_23

An online kernel-based clustering approach for value function approximation

Abstract

Cite

Register to see more suggestions