Experiments in value function approximation with sparse support vector regression

Tobias Jung; Thomas Uthmann

Conference ProceedingsOPEN ACCESS

Experiments in value function approximation with sparse support vector regression

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2004) 3201 180-191

DOI: 10.1007/978-3-540-30115-8_19

4Citations

8Readers

Abstract

We present first experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of SVR two ideas are employed. The first is sparse greedy approximation: the data is projected onto the subspace spanned by only a small subset of the original data (in feature space). This subset can be built up in an on-line fashion. Second, we use the sparsified data to solve a reduced quadratic problem, where the number of variables is independent of the total number of training samples seen. The feasability of this approach is demonstrated on two common toy-problems. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Jung, T., & Uthmann, T. (2004). Experiments in value function approximation with sparse support vector regression. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 3201, pp. 180–191). Springer Verlag. https://doi.org/10.1007/978-3-540-30115-8_19

Experiments in value function approximation with sparse support vector regression

Abstract

Cite

Register to see more suggestions