Imbalanced class distribution is a challenging problem in many real-life classification problems. Existing synthetic oversampling do suffer from the curse of dimensionality because they rely heavily on Euclidean distance. This paper proposed a new method, called Minority Oversampling Technique based on Local Densities in Low-Dimensional Space (or MOT2LD in short). MOT2LD first maps each training sample into a low-dimensional space, and makes clustering of their low-dimensional representations. It then assigns weight to each minority sample as the product of two quantities: local minority density and local majority count, indicating its importance of sampling. The synthetic minority class samples are generated inside some minority cluster. MOT2LD has been evaluated on 15 real-world data sets. The experimental results have shown that our method outperforms some other existing methods including SMOTE, Borderline- SMOTE, ADASYN, and MWMOTE, in terms of G-mean and F-measure.
CITATION STYLE
Xie, Z., Jiang, L., Ye, T., & Li, X. L. (2015). A synthetic minority oversampling method based on local densities in low-dimensional space for imbalanced learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9050, pp. 3–18). Springer Verlag. https://doi.org/10.1007/978-3-319-18123-3_1
Mendeley helps you to discover research relevant for your work.