Detecting maximum inclusion dependencies without candidate generation

6Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Inclusion dependencies (INDs) within and across databases are an important relationship for many applications in data integration, schema (re-)design, integrity checking, or query optimization. Existing techniques for detecting all INDs need to generate IND candidates and test their validity in the given data instance. However, the major disadvantage of this approach is the exponentially growing number of data accesses in terms of the number of SQL queries as well as I/O operations. We introduce Mind2, a new approach for detecting n-ary INDs (n > 1) without any candidate generation. Mind2implements a new characterization of the maximum INDs we developed in this paper. This characterization is based on set operations defined on certain metadata that Mind2generates by accessing the database only 2 × the number of valid unary INDs. Thus, Mind2eliminates the exponential number of data accesses needed by existing approaches. Furthermore, the experiments show that Mind2 is significantly more scalable than hypergraph-based approaches.

Cite

CITATION STYLE

APA

Shaabani, N., & Meinel, C. (2016). Detecting maximum inclusion dependencies without candidate generation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9828 LNCS, pp. 118–133). Springer Verlag. https://doi.org/10.1007/978-3-319-44406-2_10

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free