A Classification Method Based on Feature Selection for Imbalanced Data

Yi Liu; Yanzhen Wang; Xiaoguang Ren; Hao Zhou; Xingchun Diao

Journal ArticleOPEN ACCESS

A Classification Method Based on Feature Selection for Imbalanced Data

IEEE Access (2019) 7 81794-81807

DOI: 10.1109/ACCESS.2019.2923846

58Citations

67Readers

Abstract

Imbalanced data are very common in the real world, and it may deteriorate the performance of the conventional classification algorithms. In order to resolve the imbalanced classification problems, we propose an ensemble classification method that combines evolutionary under-sampling and feature selection. We employ the Bootstrap method in original data to generate many sample subsets. V-statistic is developed to measure the distribution of imbalanced data, and it is also taken as the optimization objective of the genetic algorithm for the under-sampling sample subsets. Moreover, we take F1 and Gmean indicators as two optimization objectives and employ the multiobjective ant colony optimization algorithm for feature selection of resampled data to construct an ensemble system. Ten low-dimensional and four high-dimensional typical imbalanced datasets are used in experiments. The six state-of-the-art algorithms and four measures are taken for a fair comparison. The experimental results show that our proposed system has a better classification performance compared with other algorithms, especially for the high-dimensional imbalanced data.

Author supplied keywords

Cite

CITATION STYLE

APA

Liu, Y., Wang, Y., Ren, X., Zhou, H., & Diao, X. (2019). A Classification Method Based on Feature Selection for Imbalanced Data. IEEE Access, 7, 81794–81807. https://doi.org/10.1109/ACCESS.2019.2923846

A Classification Method Based on Feature Selection for Imbalanced Data

Abstract

Author supplied keywords

Cite

Register to see more suggestions