Abstract
Probabilistic databases address well the requirements of an increasing number of modern applications that produce large volumes of uncertain data from a variety of sources. We propose probabilistic keys as a principled tool helping organizations balance the consistency and completeness targets for their data quality. For this purpose, algorithms are established for an agile schema- and data-driven acquisition of the marginal probability by which keys should hold in a given application domain, and for reasoning about these keys. The efficiency of our acquisition framework is demonstrated theoretically and experimentally.
Author supplied keywords
Cite
CITATION STYLE
Brown, P., & Link, S. (2015). Probabilistic keys for data quality management. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9097, pp. 118–132). Springer Verlag. https://doi.org/10.1007/978-3-319-19069-3_8
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.