Classification complexity measures play an important role in classifier selection and are primarily designed for balanced data. Focusing on binary classification, this paper proposes a novel methodology to evaluate their validity on imbalanced data. The twelve complexity measures composed by Ho are evaluated on synthetic imbalanced data sets with various probability distributions, various boundary shapes and various data skewness. The experimental results demonstrate that most of the complexity measures are statistically changeable as data skewness varies. They need to be revised and improved for imbalanced data. © 2013 Springer-Verlag.
CITATION STYLE
Xing, Y., Cai, H., Cai, Y., Hejlesen, O., & Toft, E. (2013). Preliminary evaluation of classification complexity measures on imbalanced data. In Lecture Notes in Electrical Engineering (Vol. 256 LNEE, pp. 189–196). https://doi.org/10.1007/978-3-642-38466-0_22
Mendeley helps you to discover research relevant for your work.