Data glitches: Monsters in your data

12Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Data types and data structures are becoming increasingly complex as they keep pace with evolving technologies and applications. The result is an increase in the number and complexity of data quality problems. Data glitches, a common name for data quality problems, can be simple and stand alone, or highly complex with spatial and temporal correlations. In this chapter, we provide an overview of a comprehensive and measurable data quality process. To begin, we define and classify complex glitch types, and describe detection and cleaning techniques.We present metrics for assessing data quality and for choosing cleaning strategies subject to a variety of considerations. The process culminates in a “clean” data set that is acceptable to the end user.We conclude with an overview of significant literature in this area, and a discussion of opportunities for practice, application, and further research.

Cite

CITATION STYLE

APA

Dasu, T. (2013). Data glitches: Monsters in your data. In Handbook of Data Quality: Research and Practice (pp. 163–178). Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-36257-6_8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free