Unsupervised deep domain adaptation for heterogeneous defect prediction

13Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.

Abstract

Heterogeneous defect prediction (HDP) is to detect the largest number of defective software modules in one project by using historical data collected from other projects with different metrics. However, these data can not be directly used because of different metrics set among projects. Meanwhile, software data have more non-defective instances than defective instances which may cause a significant bias towards defective instances. To completely solve these two restrictions, we propose unsupervised deep domain adaptation approach to build a HDP model. Specifically, we firstly map the data of source and target projects into a unified metric representation (UMR). Then, we design a simple neural network (SNN) model to deal with the heterogeneous and class-imbalanced problems in software defect prediction (SDP). In particular, our model introduces the Maximum Mean Discrepancy (MMD) as the distance between the source and target data to reduce the distribution mismatch, and use the cross-entropy loss function as the classification loss. Extensive experiments on 18 public projects from four datasets indicate that the proposed approach can build an effective prediction model for heterogeneous defect prediction (HDP) and outperforms the related competing approaches.

Cite

CITATION STYLE

APA

Gong, L., Jiang, S., Yu, Q., & Jiang, L. (2019). Unsupervised deep domain adaptation for heterogeneous defect prediction. IEICE Transactions on Information and Systems, E102D(3), 537–549. https://doi.org/10.1587/transinf.2018EDP7289

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free