Processing aggregate queries in a federation of SPARQL endpoints

Dilshod Ibragimov; Katja Hose; Torben Bach Pedersen; Esteban Zimányi

Conference ProceedingsOPEN ACCESS

Processing aggregate queries in a federation of SPARQL endpoints

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9088 269-285

DOI: 10.1007/978-3-319-18818-8_17

18Citations

7Readers

Abstract

More andmore RDF data is exposed on the Web via SPARQL endpoints. With the recent SPARQL 1.1 standard, these datasets can be queried in novel and more powerful ways, e.g., complex analysis tasks involving grouping and aggregation, and even data frommultiple SPARQL endpoints, can now be formulated in a single query. This enables Business Intelligence applications that access data from federated web sources and can combine it with local data. However, as both aggregate and federated queries have become available only recently, state-of-the-art systems lack sophisticated optimization techniques that facilitate efficient execution of such queries over large datasets. To overcome these shortcomings, we propose a set of query processing strategies and the associated Cost-based Optimizer for Distributed Aggregate queries (CoDA) for executing aggregate SPARQL queries over federations of SPARQL endpoints. Our comprehensive experiments show that CoDA significantly improves performance over current state-of-the-art systems.

Cite

CITATION STYLE

APA

Ibragimov, D., Hose, K., Pedersen, T. B., & Zimányi, E. (2015). Processing aggregate queries in a federation of SPARQL endpoints. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9088, pp. 269–285). Springer Verlag. https://doi.org/10.1007/978-3-319-18818-8_17

Processing aggregate queries in a federation of SPARQL endpoints

Abstract

Cite

Register to see more suggestions