Class Imbalance in Software Fault Prediction Data Set

C. Arun; C. Lakshmi

Conference Proceedings

Class Imbalance in Software Fault Prediction Data Set

Advances in Intelligent Systems and Computing (2020) 1056 745-757

DOI: 10.1007/978-981-15-0199-9_64

9Citations

11Readers

Get full text

Abstract

Classification has been the prominent technique in machine learning domain, due to its ability of forecasting and predicts capabilities it is widely used in various domains such as health care, networking, social network, and software engineering with enhancement of different algorithm. The performance of the classifier majorly depends on the quality and amount of data present in the training sample. In real-world scenario, the majority of training samples suffered from class imbalance problem, that is, most of the data samples belong to one particular category, i.e., majority class while very few represent the minority class. In this case, classification techniques tend to be overwhelmed by the majority class and ignore the minority class. To solve class imbalance problem people relay on the different kind of sampling techniques either by generating synthetic data or by concentrating on minority class samples, but those approaches have introduced adverse effect in the learnability. In this paper, we attempt to study different techniques proposed to solve the class imbalance problem.

Author supplied keywords

Cite

CITATION STYLE

APA

Arun, C., & Lakshmi, C. (2020). Class Imbalance in Software Fault Prediction Data Set. In Advances in Intelligent Systems and Computing (Vol. 1056, pp. 745–757). Springer. https://doi.org/10.1007/978-981-15-0199-9_64

Class Imbalance in Software Fault Prediction Data Set

Abstract

Author supplied keywords

Cite

Register to see more suggestions