Feature-discovering approximate value iteration methods

Jia Hong Wu; Robert Givan

Conference Proceedings

Feature-discovering approximate value iteration methods

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3607 LNAI 321-331

DOI: 10.1007/11527862_25

5Citations

31Readers

Get full text

Abstract

Sets of features in Markov decision processes can play a critical role in approximately representing value and in abstracting the state space. Selection of features is crucial to the success of a system and is most often conducted by a human. We study the problem of automatically selecting problem features, and propose and evaluate a simple approach reducing the problem of selecting a new feature to standard classification learning. We learn a classifier that predicts the sign of the Bellman error over a training set of states. By iteratively adding new classifiers as features with this method, training between iterations with approximate value iteration, we find a Tetris feature set that outperforms randomly constructed features significantly, and obtains a score of about three-tenths of the highest score obtained by using a carefully hand-constructed feature set. We also show that features learned with this method outperform those learned with the previous method of Patrascu et al. [4] on the same SysAdmin domain used for evaluation there. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Wu, J. H., & Givan, R. (2005). Feature-discovering approximate value iteration methods. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3607 LNAI, pp. 321–331). Springer Verlag. https://doi.org/10.1007/11527862_25

Feature-discovering approximate value iteration methods

Abstract

Cite

Register to see more suggestions