Contextual Data Cleaning with Ontology FDs

2Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Functional Dependencies (FDs) define attribute relationships based on syntactic equality, and, when used in data cleaning, they erroneously label syntactically different but semantically equivalent values as errors. We motivate the need to include context in data cleaning in order to account for the subjective nature of data quality. We enhance dependency-based data cleaning with Ontology Functional Dependencies (OFDs), which express semantic attribute relationships such as synonyms and is-a hierarchies defined by an ontology. We study the data and ontology repair problem for a set of OFDs, and propose an algorithm that finds the best ontological interpretation of the data that minimizes the number of repairs.

Cite

CITATION STYLE

APA

Zheng, Z. (2021). Contextual Data Cleaning with Ontology FDs. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 2911–2913). Association for Computing Machinery. https://doi.org/10.1145/3448016.3450583

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free