Embedding Monte Carlo search of features in tree-based ensemble methods

Francis Maes; Pierre Geurts; Louis Wehenkel

Conference ProceedingsOPEN ACCESS

Embedding Monte Carlo search of features in tree-based ensemble methods

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7523 LNAI(PART 1) 191-206

DOI: 10.1007/978-3-642-33460-3_18

2Citations

16Readers

Abstract

Feature generation is the problem of automatically constructing good features for a given target learning problem. While most feature generation algorithms belong either to the filter or to the wrapper approach, this paper focuses on embedded feature generation. We propose a general scheme to embed feature generation in a wide range of tree-based learning algorithms, including single decision trees, random forests and tree boosting. It is based on the formalization of feature construction as a sequential decision making problem addressed by a tractable Monte Carlo search algorithm coupled with node splitting. This leads to fast algorithms that are applicable to large-scale problems. We empirically analyze the performances of these tree-based learners combined or not with the feature generation capability on several standard datasets. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Maes, F., Geurts, P., & Wehenkel, L. (2012). Embedding Monte Carlo search of features in tree-based ensemble methods. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7523 LNAI, pp. 191–206). https://doi.org/10.1007/978-3-642-33460-3_18

Embedding Monte Carlo search of features in tree-based ensemble methods

Abstract

Author supplied keywords

Cite

Register to see more suggestions