Predicting build outcomes in continuous integration using textual analysis of source code commits

Khaled Al-Sabbagh; Miroslaw Staron; Regina Hebig

Conference ProceedingsOPEN ACCESS

Predicting build outcomes in continuous integration using textual analysis of source code commits

PROMISE 2022 - Proceedings of the 18th International Conference on Predictive Models and Data Analytics in Software Engineering, co-located with ESEC/FSE 2022 (2022) 42-51

DOI: 10.1145/3558489.3559070

2Citations

9Readers

Abstract

Machine learning has been increasingly used to solve various software engineering tasks. One example of its usage is to predict the outcome of builds in continuous integration, where a classifier is built to predict whether new code commits will successfully compile. The aim of this study is to investigate the effectiveness of fifteen software metrics in building a classifier for build outcome prediction. Particularly, we implemented an experiment wherein we compared the effectiveness of a line-level metric and fourteen other traditional software metrics on 49,040 build records that belong to 117 Java projects. We achieved an average precision of 91% and recall of 80% when using the line-level metric for training, compared to 90% precision and 76% recall for the next best traditional software metric. In contrast, using file-level metrics was found to yield a higher predictive quality (average MCC for the best software metric= 68%) than the line-level metric (average MCC= 16%) for the failed builds. We conclude that file-level metrics are better predictors of build outcomes for the failed builds, whereas the line-level metric is a slightly better predictor of passed builds.

Author supplied keywords

Cite

CITATION STYLE

APA

Al-Sabbagh, K., Staron, M., & Hebig, R. (2022). Predicting build outcomes in continuous integration using textual analysis of source code commits. In PROMISE 2022 - Proceedings of the 18th International Conference on Predictive Models and Data Analytics in Software Engineering, co-located with ESEC/FSE 2022 (pp. 42–51). Association for Computing Machinery, Inc. https://doi.org/10.1145/3558489.3559070

Predicting build outcomes in continuous integration using textual analysis of source code commits

Abstract

Author supplied keywords

Cite

Register to see more suggestions