In the process of software defect prediction dataset construction, there are problems such as high labeling costs. Active learning can reduce labeling costs when using uncertainty sampling. Samples with the most uncertainty will be labeled, but samples with the highest certainty will always be discarded. According to cognitive theory, easy samples can promote the performance of the model. Therefore, a hybrid active learning query strategy is proposed. For the sample with lowest information entropy, query-by-committee will analyze it again using vote entropy. Empirical studies show that the proposed HIVE approach outperforms several state-of-the-art active learning approaches.
CITATION STYLE
Qu, Y., Chen, X., Chen, R., Ju, X., & Guo, J. (2019). Active learning using uncertainty sampling and query-by-committee for software defect prediction. International Journal of Performability Engineering, 15(10), 2701–2708. https://doi.org/10.23940/ijpe.19.10.p16.27012708
Mendeley helps you to discover research relevant for your work.