Data Mining (DM) projects are implemented by following the knowledge discovery process. Several techniques for detecting and handling data quality problems such as missing data, outliers, inconsistent data or time-variant data, can be found in the literature of DM and Data Warehousing (DW). Tasks that are related to the quality of data are mostly in the Data Understanding and in the Data Preparation phases of the DM process. The main limitation in the application of the data quality management techniques is the complexity caused by a lack of anticipation in the detection and resolution of the problems. A DM process model designed for the prior management of data quality is proposed in this work. In this model, the DM process is defined in relation to the Software Engineering (SE) process; the two processes are combined in parallel. The main contribution of this DM process is the anticipation and the automation of all activities necessary to remove data quality problems.
CITATION STYLE
Camara, M. S., Naguingar, D., & Bah, A. (2015). Prior data quality management in data mining process. Lecture Notes in Electrical Engineering, 312, 299–307. https://doi.org/10.1007/978-3-319-06764-3_37
Mendeley helps you to discover research relevant for your work.