RkNN query processing in distributed spatial infrastructures: A performance study

Francisco García-García; Antonio Corral; Luis Iribarne; Michael Vassilakopoulos

Conference Proceedings

RkNN query processing in distributed spatial infrastructures: A performance study

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10563 LNCS 200-207

DOI: 10.1007/978-3-319-66854-3_15

4Citations

14Readers

Get full text

Abstract

The Reverse k-Nearest Neighbor (RkNN) problem, i.e. finding all objects in a dataset that have a given query point among their corresponding k-nearest neighbors, has received increasing attention in the past years. RkNN queries are of particular interest in a wide range of applications such as decision support systems, resource allocation, profile-based marketing, location-based services, etc. With the current increasing volume of spatial data, it is difficult to perform RkNN queries efficiently in spatial data-intensive applications, because of the limited computational capability and storage resources. In this paper, we investigate how to design and implement distributed RkNN query algorithms using shared-nothing spatial cloud infrastructures as SpatialHadoop and LocationSpark. SpatialHadoop is a framework that inherently supports spatial indexing on top of Hadoop to perform efficiently spatial queries. LocationSpark is a recent spatial data processing system built on top of Spark. We have evaluated the performance of the distributed RkNN query algorithms on both SpatialHadoop and LocationSpark with big real-world datasets. The experiments have demonstrated the efficiency and scalability of our proposal in both distributed spatial data management systems, showing the performance advantages of LocationSpark.

Author supplied keywords

Cite

CITATION STYLE

APA

García-García, F., Corral, A., Iribarne, L., & Vassilakopoulos, M. (2017). RkNN query processing in distributed spatial infrastructures: A performance study. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10563 LNCS, pp. 200–207). Springer Verlag. https://doi.org/10.1007/978-3-319-66854-3_15

RkNN query processing in distributed spatial infrastructures: A performance study

Abstract

Author supplied keywords

Cite

Register to see more suggestions