Probability Density Machine: A New Solution of Class Imbalance Learning

Ruihan Cheng; Longfei Zhang; Shiqi Wu; Sen Xu; Shang Gao; Hualong Yu

Journal ArticleOPEN ACCESS

Probability Density Machine: A New Solution of Class Imbalance Learning

Scientific Programming (2021) 2021

DOI: 10.1155/2021/7555587

4Citations

11Readers

Abstract

Class imbalance learning (CIL) is an important branch of machine learning as, in general, it is difficult for classification models to learn from imbalanced data; meanwhile, skewed data distribution frequently exists in various real-world applications. In this paper, we introduce a novel solution of CIL called Probability Density Machine (PDM). First, in the context of Gaussian Naive Bayes (GNB) predictive model, we analyze the reason why imbalanced data distribution makes the performance of predictive model decline in theory and draw a conclusion regarding the impact of class imbalance that is only associated with the prior probability, but does not relate to the conditional probability of training data. Then, in such context, we show the rationality of several traditional CIL techniques. Furthermore, we indicate the drawback of combining GNB with these traditional CIL techniques. Next, profiting from the idea of K-nearest neighbors probability density estimation (KNN-PDE), we propose the PDM which is an improved GNB-based CIL algorithm. Finally, we conduct experiments on lots of class imbalance data sets, and the proposed PDM algorithm shows the promising results.

Cite

CITATION STYLE

APA

Cheng, R., Zhang, L., Wu, S., Xu, S., Gao, S., & Yu, H. (2021). Probability Density Machine: A New Solution of Class Imbalance Learning. Scientific Programming, 2021. https://doi.org/10.1155/2021/7555587

Probability Density Machine: A New Solution of Class Imbalance Learning

Abstract

Cite

Register to see more suggestions