Data Mining and Knowledge Discovery Handbook

N/ACitations
Citations of this article
830Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This chapter analyzes the problem of data cleansing and the identification of potential errors in data sets. The differing views of data cleansing are surveyed and reviewed and a brief overview of existing data cleansing tools is given. A general framework of the data cleansing process is presented as well as a set of general methods that can be used to address the problem. The applicable methods include statistical outlier detection, pattern matching, clustering, and Data Mining techniques. The experimental results of applying these methods to a real world data set are also given. Finally, research directions necessary to further address the data cleansing problem are discussed.

Cite

CITATION STYLE

APA

Data Mining and Knowledge Discovery Handbook. (2010). Data Mining and Knowledge Discovery Handbook. Springer US. https://doi.org/10.1007/978-0-387-09823-4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free