Optimisation and evaluation of random forests for imbalanced datasets

Julien Thomas; Pierre Emmanuel Jouve; Nicolas Nicoloyannis

Conference Proceedings

Optimisation and evaluation of random forests for imbalanced datasets

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4203 LNAI 622-631

DOI: 10.1007/11875604_69

5Citations

16Readers

Get full text

Abstract

This paper deals with an optimization of Random Forests which aims at: adapting the concept of forest for learning imbalanced data as well as taking into account user's wishes as far as recall and precision rates are concerned. We propose to adapt Random Forest on two levels. First of all, during the forest creation thanks to the use of asymmetric entropy measure associated to specific leaf class assignation rules. Then, during the voting step, by using an alternative strategy to the classical majority voting strategy. The automation of this second step requires a specific methodology for results quality assessment. This methodology allows the user to define his wishes concerning (1) recall and precision rates for each class of the concept to learn, and, (2) the importance he wants to confer to each one of those classes. Finally, results of experimental evaluations are presented. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Thomas, J., Jouve, P. E., & Nicoloyannis, N. (2006). Optimisation and evaluation of random forests for imbalanced datasets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4203 LNAI, pp. 622–631). Springer Verlag. https://doi.org/10.1007/11875604_69

Optimisation and evaluation of random forests for imbalanced datasets

Abstract

Cite

Register to see more suggestions