Convolutional Neural Networks-Based Locating Relevant Buggy Code Files for Bug Reports Affected by Data Imbalance

Guangliang Liu; Yang Lu; Ke Shi; Jingfei Chang; Xing Wei

Journal ArticleOPEN ACCESS

Convolutional Neural Networks-Based Locating Relevant Buggy Code Files for Bug Reports Affected by Data Imbalance

IEEE Access (2019) 7 131304-131316

DOI: 10.1109/ACCESS.2019.2940557

10Citations

17Readers

Abstract

Software bug localization is very important in software engineering, but it is also complicated and time consuming. To improve the efficiency of developers, researchers have developed various traditional bug localization and machine learning bug localization methods. In this paper, we propose a novel method that improves bug localization performance. First, surface lexical correlation matching between bug reports and source code files is used to obtain features by deep neural network. Second, to solve the lexical gap between bug reports and source code files, semantic correlation matching between them is used to obtain features based on word embedding and sentence embedding by deep neural network. Then, the joint features obtained by the surface lexical and semantic correlation matching are fused into a unified feature representation for bug reports and source code files. In addition, since our experimental datasets are imbalanced data, we use a focal loss function to solve the impact of data imbalance. Finally, our method obtains the relatively high bug localization performance compared to other classic methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Liu, G., Lu, Y., Shi, K., Chang, J., & Wei, X. (2019). Convolutional Neural Networks-Based Locating Relevant Buggy Code Files for Bug Reports Affected by Data Imbalance. IEEE Access, 7, 131304–131316. https://doi.org/10.1109/ACCESS.2019.2940557

Convolutional Neural Networks-Based Locating Relevant Buggy Code Files for Bug Reports Affected by Data Imbalance

Abstract

Author supplied keywords

Cite

Register to see more suggestions