A formal taxonomy to improve data defect description

João Marcelo Borovina Josko; Marcio Katsumi Oikawa; João Eduardo Ferreira

Conference Proceedings

A formal taxonomy to improve data defect description

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9645 307-320

DOI: 10.1007/978-3-319-32055-7_25

6Citations

15Readers

Get full text

Abstract

Data quality assessment outcomes are essential for analytical processes, especially for big data environment. Its efficiency and efficacy depends on automated solutions, which are determined by understanding the problem associated with each data defect. Despite the considerable number of works that describe data defects regarding to accuracy, completeness and consistency, there is a significant heterogeneity of terminology, nomenclature, description depth and number of examined defects. To cover this gap, this work reports a taxonomy that organizes data defects according to a three-step methodology. The proposed taxonomy enhances the descriptions and coverage of defects with regard to the related works, and also supports certain requirements of data quality assessment, including the design of semi-supervised solutions to data defect detection.

Author supplied keywords

Cite

CITATION STYLE

APA

Borovina Josko, J. M., Oikawa, M. K., & Ferreira, J. E. (2016). A formal taxonomy to improve data defect description. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9645, pp. 307–320). Springer Verlag. https://doi.org/10.1007/978-3-319-32055-7_25

A formal taxonomy to improve data defect description

Abstract

Author supplied keywords

Cite

Register to see more suggestions