Poor quality data may be detected and corrected by performing various quality assurance activities that rely on techniques with different efficacy and cost. In this paper, we propose a quantitative approach for measuring and comparing the effectiveness of these data quality (DQ) techniques. Our definitions of effectiveness are inspired by measures proposed in Information Retrieval. We show how the effectiveness of a DQ technique can be mathematically estimated in general cases, using formal techniques that are based on probabilistic assumptions. We then show how the resulting effectiveness formulas can be used to evaluate, compare and make choices involving DQ techniques. © 2009 Springer Berlin Heidelberg.
CITATION STYLE
Jiang, L., Barone, D., Borgida, A., & Mylopoulos, J. (2009). Measuring and comparing effectiveness of data quality techniques. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5565 LNCS, pp. 171–185). https://doi.org/10.1007/978-3-642-02144-2_17
Mendeley helps you to discover research relevant for your work.