A synthetic minority oversampling method based on local densities in low-dimensional space for imbalanced learning

24Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Imbalanced class distribution is a challenging problem in many real-life classification problems. Existing synthetic oversampling do suffer from the curse of dimensionality because they rely heavily on Euclidean distance. This paper proposed a new method, called Minority Oversampling Technique based on Local Densities in Low-Dimensional Space (or MOT2LD in short). MOT2LD first maps each training sample into a low-dimensional space, and makes clustering of their low-dimensional representations. It then assigns weight to each minority sample as the product of two quantities: local minority density and local majority count, indicating its importance of sampling. The synthetic minority class samples are generated inside some minority cluster. MOT2LD has been evaluated on 15 real-world data sets. The experimental results have shown that our method outperforms some other existing methods including SMOTE, Borderline- SMOTE, ADASYN, and MWMOTE, in terms of G-mean and F-measure.

Cite

CITATION STYLE

APA

Xie, Z., Jiang, L., Ye, T., & Li, X. L. (2015). A synthetic minority oversampling method based on local densities in low-dimensional space for imbalanced learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9050, pp. 3–18). Springer Verlag. https://doi.org/10.1007/978-3-319-18123-3_1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free