Automatic rules generation approach for data cleaning in medical applications

1Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Data quality is considered crucial challenge in emerging big data scenarios. Data mining techniques can be reutilized efficiently in data cleaning process. Recent studies have shown that databases are often suffered from inconsistent data issues, which ought to be resolved in the cleaning process. In this paper, we introduce an automated approach for dependably generating rules from databases themselves, in order to detect data inconsistency problems from large databases. The proposed approach employs confidence and lift measures with integrity constraints, in order to guarantee that generated rules are minimal, non-redundant and precise. The proposed approach is validated against several datasets from healthcare domain. We experimentally demonstrate that our approach outperform significant enhancement over existing approaches.

Cite

CITATION STYLE

APA

Abdo, A. S., Salem, R. K., & Abdul-Kader, H. M. (2016). Automatic rules generation approach for data cleaning in medical applications. In Advances in Intelligent Systems and Computing (Vol. 407, pp. 3–14). Springer Verlag. https://doi.org/10.1007/978-3-319-26690-9_1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free