Metric-Based Fault Prediction for Spreadsheets

Patrick Koch; Konstantin Schekotihin; Dietmar Jannach; Birgit Hofer; Franz Wotawa

Journal ArticleOPEN ACCESS

Metric-Based Fault Prediction for Spreadsheets

IEEE Transactions on Software Engineering (2021) 47(10) 2195-2207

DOI: 10.1109/TSE.2019.2944604

4Citations

21Readers

Abstract

Electronic spreadsheets are widely used in organizations for various data analytics and decision-making tasks. Even though faults within such spreadsheets are common and can have significant negative consequences, today's tools for creating and handling spreadsheets provide limited support for fault detection, localization, and repair. Being able to predict whether a certain part of a spreadsheet is faulty or not is often central for the implementation of such supporting functionality. In this work, we propose a novel approach to fault prediction in spreadsheet formulas, which combines an extensive catalog of spreadsheet metrics with modern machine learning algorithms. An analysis of the individual metrics from our catalog reveals that they are generally suited to discover a wide range of faults. Their predictive power is, however, limited when considered in isolation. Therefore, in our approach we apply supervised learning algorithms to obtain fault predictors that utilize all data provided by multiple spreadsheet metrics from our catalog. Experiments on different datasets containing faulty spreadsheets show that particularly Random Forests classifiers are often effective. As a result, the proposed method is in many cases able to make highly accurate predictions whether a given formula of a spreadsheet is faulty.11.Results of a preliminary study were published in [1].

Author supplied keywords

Cite

CITATION STYLE

APA

Koch, P., Schekotihin, K., Jannach, D., Hofer, B., & Wotawa, F. (2021). Metric-Based Fault Prediction for Spreadsheets. IEEE Transactions on Software Engineering, 47(10), 2195–2207. https://doi.org/10.1109/TSE.2019.2944604

Metric-Based Fault Prediction for Spreadsheets

Abstract

Author supplied keywords

Cite

Register to see more suggestions