Computing optimal repairs for functional dependencies

30Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

We investigate the complexity of computing an optimal repair of an inconsistent database, in the case where integrity constraints are Functional Dependencies (FDs).We focus on two types of repairs: an optimal subset repair (optimal S-repair), which is obtained by a minimum number of tuple deletions, and an optimal update repair (optimal U-repair), which is obtained by a minimum number of value (cell) updates. For computing an optimal S-repair, we present a polynomial-time algorithm that succeeds on certain sets of FDs and fails on others. We prove the following about the algorithm. When it succeeds, it can also incorporate weighted tuples and duplicate tuples. When it fails, the problem is NP-hard and, in fact, APX-complete (hence, cannot be approximated better than some constant). Thus, we establish a dichotomy in the complexity of computing an optimal S-repair. We present general analysis techniques for the complexity of computing an optimal Urepair, some based on the dichotomy for S-repairs. We also draw a connection to a past dichotomy in the complexity of finding a most probable database that satisfies a set of FDs with a single attribute on the left-hand side; the case of general FDs was left open, and we show how our dichotomy provides the missing generalization and thereby settles the open problem.

Cite

CITATION STYLE

APA

Livshits, E., Kimelfeld, B., & Roy, S. (2020). Computing optimal repairs for functional dependencies. ACM Transactions on Database Systems, 45(1). https://doi.org/10.1145/3360904

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free