Evaluating the effectiveness of data quality framework in software engineering

3Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.

Abstract

The quality of data is important in research working with data sets because poor data quality may lead to invalid results. Data sets contain measurements that are associated with metrics and entities; however, in some data sets, it is not always clear which entities have been measured and exactly which metrics have been used. This means that measurements could be misinterpreted. In this study, we develop a framework for data quality assessment that determines whether a data set has sufficient information to support the correct interpretation of data for analysis in empirical research. The framework incorporates a dataset metamodel and a quality assessment process to evaluate the data set quality. To evaluate the effectiveness of our framework, we conducted a user study. We used observations, a questionnaire and think aloud approach to provide insights into the framework through participant thought processes while applying the framework. The results of our study provide evidence that most participants successfully applied the definitions of dataset category elements and the formal definitions of data quality issues to the datasets. Further work is needed to reproduce our results with more participants, and to determine whether the data quality framework is generalizable to other types of data sets.

Cite

CITATION STYLE

APA

Rosli, M. M., & Yusop, N. S. M. (2022). Evaluating the effectiveness of data quality framework in software engineering. International Journal of Electrical and Computer Engineering, 12(6), 6410–6422. https://doi.org/10.11591/ijece.v12i6.pp6410-6422

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free