An empirical study of the behavior of classifiers on imbalanced and overlapped data sets

Vicente García; Jose Sánchez; Ramon Mollineda

Conference ProceedingsOPEN ACCESS

An empirical study of the behavior of classifiers on imbalanced and overlapped data sets

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4756 LNCS 397-406

DOI: 10.1007/978-3-540-76725-1_42

99Citations

48Readers

Abstract

Class imbalance has been reported as an important obstacle to apply traditional learning algorithms to real-world domains. Recent investigations have questioned whether the imbalance is the unique factor that hinders the performance of classifiers. In this paper, we study the behavior of six algorithms when classifying imbalanced, overlapped data sets under uncommon situations (e.g., when the overall imbalance ratio is different from the local imbalance ratio in the overlap region). This is accomplished by analyzing the accuracy on each individual class, thus devising how those situations affect the majority and minority classes. The experiments corroborate that overlap is more important than imbalance for the classification performance. Also, they show that the classifiers behave differently depending on the nature of each model. © Springer-Verlag Berlin Heidelberg 2007.

Author supplied keywords

Cite

CITATION STYLE

APA

García, V., Sánchez, J., & Mollineda, R. (2007). An empirical study of the behavior of classifiers on imbalanced and overlapped data sets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4756 LNCS, pp. 397–406). https://doi.org/10.1007/978-3-540-76725-1_42

An empirical study of the behavior of classifiers on imbalanced and overlapped data sets

Abstract

Author supplied keywords

Cite

Register to see more suggestions