Abstract
In many applications including loosely coupled cloud databases, collaborative editing and network monitoring, data from multiple sources is regularly used for query answering. For reasons such as system failures, insufficient author knowledge or network issues, data may be temporarily unavailable or generally nonexistent. Hence, not all data needed for query answering may be available. In this paper, we propose a natural class of completeness patterns, expressed by selections on database tables, to specify complete parts of database tables. We then show how to adapt the operators of relational algebra so that they manipulate these completeness patterns to compute completeness patterns pertaining to query answers. Our proposed algebra is computationally sound and complete with respect to the information that the patterns provide. We show that stronger completeness patterns can be obtained by considering not only the schema but also the database instance and we extend the algebra to take into account this additional information. We develop novel techniques to efficiently implement the computation of completeness patterns on query answers and demonstrate their scalability on real data.
Cite
CITATION STYLE
Razniewski, S., Korn, F., Nutt, W., & Srivastava, D. (2015). Identifying the extent of completeness of query answers over partially complete databases. In Proceedings of the ACM SIGMOD International Conference on Management of Data (Vol. 2015-May, pp. 561–576). Association for Computing Machinery. https://doi.org/10.1145/2723372.2750544
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.