An empirical study of learning from imbalanced data using random forest

368Citations
Citations of this article
264Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper discusses a comprehensive suite of experiments that analyze the performance of the random forest (RF) learner implemented in Weka. RF is a relatively new learner, and to the best of our knowledge, only preliminary experimentation on the construction of random forest classifiers in the context of imbalanced data has been reported in previous work. Therefore, the contribution of this study is to provide an extensive empirical evaluation of RF learners built from imbalanced data. What should be the recommended default number of trees in the ensemble? What should the recommended value be for the number of attributes? How does the RF learner perform on imbalanced data when compared with other commonly-used learners? We address these and other related issues in this work. © 2007 IEEE.

Cite

CITATION STYLE

APA

Khoshgoftaar, T. M., Golawala, M., & Van Hulse, J. (2007). An empirical study of learning from imbalanced data using random forest. In Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI (Vol. 2, pp. 310–317). https://doi.org/10.1109/ICTAI.2007.46

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free