Severe class imbalance: Why better algorithms aren't the answer

39Citations
Citations of this article
63Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This paper argues that severe class imbalance is not just an interesting technical challenge that improved learning algorithms will address, it is much more serious. To be useful, a classifier must appreciably outperform a trivial solution, such as choosing the majority class. Any application that is inherently noisy limits the error rate, and cost, that is achievable. When data are normally distributed, even a Bayes optimal classifier has a vanishingly small reduction in the majority classifier's error rate, and cost, as imbalance increases. For fat tailed distributions, and when practical classifiers are used, often no reduction is achieved. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Drummond, C., & Holte, R. C. (2005). Severe class imbalance: Why better algorithms aren’t the answer. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3720 LNAI, pp. 539–546). https://doi.org/10.1007/11564096_52

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free