Efficient rank based KNN query processing over uncertain data

Ying Zhang; Xuemin Lin; Gaoping Zhu; Wenjie Zhang; Qianlu Lin

Conference Proceedings

Efficient rank based KNN query processing over uncertain data

Proceedings - International Conference on Data Engineering (2010) 28-39

DOI: 10.1109/ICDE.2010.5447874

29Citations

35Readers

Get full text

Abstract

Uncertain data are inherent in many applications such as environmental surveillance and quantitative economics research. As an important problem in many applications, KNN query has been extensively investigated in the literature. In this paper, we study the problem of processing rank based KNN query against uncertain data. Besides applying the expected rank semantic to compute KNN, we also introduce the median rank which is less sensitive to the outliers. We show both ranking methods satisfy nice top-k properties such as exact-k, containment, unique ranking, value invariance, stability and fairfulness. For given query q, IO and CPU efficient algorithms are proposed in the paper to compute KNN based on expected (median) ranks of the uncertain objects. To tackle the correlations of the uncertain objects and high IO cost caused by large number of instances of the uncertain objects, randomized algorithms are proposed to approximately compute KNN with theoretical guarantees. Comprehensive experiments are conducted on both real and synthetic data to demonstrate the efficiency of our techniques. © 2010 IEEE.

Cite

CITATION STYLE

APA

Zhang, Y., Lin, X., Zhu, G., Zhang, W., & Lin, Q. (2010). Efficient rank based KNN query processing over uncertain data. In Proceedings - International Conference on Data Engineering (pp. 28–39). https://doi.org/10.1109/ICDE.2010.5447874

Efficient rank based KNN query processing over uncertain data

Abstract

Cite

Register to see more suggestions