Data glitches: Monsters in your data

Tamraparni Dasu

Book Chapter

Data glitches: Monsters in your data

Dasu T

Springer Berlin Heidelberg, (2013), 163-178

DOI: 10.1007/978-3-642-36257-6_8

12Citations

13Readers

Get full text

Abstract

Data types and data structures are becoming increasingly complex as they keep pace with evolving technologies and applications. The result is an increase in the number and complexity of data quality problems. Data glitches, a common name for data quality problems, can be simple and stand alone, or highly complex with spatial and temporal correlations. In this chapter, we provide an overview of a comprehensive and measurable data quality process. To begin, we define and classify complex glitch types, and describe detection and cleaning techniques.We present metrics for assessing data quality and for choosing cleaning strategies subject to a variety of considerations. The process culminates in a “clean” data set that is acceptable to the end user.We conclude with an overview of significant literature in this area, and a discussion of opportunities for practice, application, and further research.

Cite

CITATION STYLE

APA

Dasu, T. (2013). Data glitches: Monsters in your data. In Handbook of Data Quality: Research and Practice (pp. 163–178). Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-36257-6_8

Data glitches: Monsters in your data

Abstract

Cite

Register to see more suggestions