Detecting maximum inclusion dependencies without candidate generation

Nuhad Shaabani; Christoph Meinel

Conference Proceedings

Detecting maximum inclusion dependencies without candidate generation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9828 LNCS 118-133

DOI: 10.1007/978-3-319-44406-2_10

6Citations

5Readers

Get full text

Abstract

Inclusion dependencies (INDs) within and across databases are an important relationship for many applications in data integration, schema (re-)design, integrity checking, or query optimization. Existing techniques for detecting all INDs need to generate IND candidates and test their validity in the given data instance. However, the major disadvantage of this approach is the exponentially growing number of data accesses in terms of the number of SQL queries as well as I/O operations. We introduce Mind2, a new approach for detecting n-ary INDs (n > 1) without any candidate generation. Mind2implements a new characterization of the maximum INDs we developed in this paper. This characterization is based on set operations defined on certain metadata that Mind2generates by accessing the database only 2 × the number of valid unary INDs. Thus, Mind2eliminates the exponential number of data accesses needed by existing approaches. Furthermore, the experiments show that Mind2 is significantly more scalable than hypergraph-based approaches.

Author supplied keywords

Cite

CITATION STYLE

APA

Shaabani, N., & Meinel, C. (2016). Detecting maximum inclusion dependencies without candidate generation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9828 LNCS, pp. 118–133). Springer Verlag. https://doi.org/10.1007/978-3-319-44406-2_10

Detecting maximum inclusion dependencies without candidate generation

Abstract

Author supplied keywords

Cite

Register to see more suggestions