In an uncertain data set S = (S, p, f) where S is the ground set consisting of n elements, p : S → [0,1] a probability function, and f : S → ℝ a score function, each element i ∈ S with score f(i) appears independently with probability p(i). The top-k query on S asks for the set of k elements that has the maximum probability of appearing to be the k elements with the highest scores in a random instance of S. Computing the top-k answer on a fixed S is known to be easy. In this paper, we consider the dynamic problem, that is, how to maintain the top-k query answer when S changes, including element insertion and deletions in the ground set S, changes in the probability function p and the score function f. We present a fully dynamic data structure that handles an update in O(k log k log n) time, and answers a top-j query in O(log n+j) time for any j ≤ k. The structure has O(n) size and can be constructed in O(n log2 k) time. As a building block of our dynamic structure, we present an algorithm for the all-top-k problem, that is, computing the top-j answers for all j = 1,..., k, which may be of independent interest. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Chen, J., & Yi, K. (2007). Dynamic structures for top-k queries on uncertain data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4835 LNCS, pp. 427–438). Springer Verlag. https://doi.org/10.1007/978-3-540-77120-3_38
Mendeley helps you to discover research relevant for your work.