Abstract
Prediction the number of faults in software modules can be more helpful instead of predicting the modules being faulty or non-faulty. Some regression models have been used for predicting the number of faults. However, the software defect data may involve irrelevant and redundant module features, which will degrade the performance of these regression models. To address such issue, this paper proposes a feature selection method based on Feature Spectral Clustering and feature Ranking (FSCR) for the number of software faults prediction. First, FSCR groups the original features with spectral clustering according to the correlation between every two features. Second, FSCR employs ReliefF algorithm to compute the relevance between each feature with respect to the number of faults and selects top p most relevant features from each resulted cluster. We evaluate our proposed method on 6 widely-studied project datasets with four performance metrics. Comparison with five existing feature selection methods demonstrates that FSCR is effective in selecting features for the number of faults prediction.
Author supplied keywords
Cite
CITATION STYLE
Yu, X., Ma, Z., Ma, C., Gu, Y., Liu, R., & Zhang, Y. (2017). FSCR:A Feature selection method for software defect prediction. In Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE (pp. 351–356). Knowledge Systems Institute Graduate School. https://doi.org/10.18293/SEKE2017-081
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.