Federated SPARQL queries processing with replicated fragments

Gabriela Montoya; Hala Skaf-Molli; Pascal Molli; Maria Esther Vidal

Conference ProceedingsOPEN ACCESS

Federated SPARQL queries processing with replicated fragments

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9366 36-51

DOI: 10.1007/978-3-319-25007-6_3

17Citations

27Readers

Get full text

Abstract

Federated query engines provide a unified query interface to federations of SPARQL endpoints. Replicating data fragments from different Linked Data sources facilitates data re-organization to better fit federated query processing needs of data consumers. However, existing federated query engines are not designed to support replication and replicated data can negatively impact their performance. In this paper, we formulate the source selection problem with fragment replication (SSP-FR). For a given set of endpoints with replicated fragments and a SPARQL query, the problem is to select the endpoints that minimize the number of tuples to be transferred. We devise the Fedra source selection algorithm that approximates SSP-FR. We implement Fedra in the state-of-the-art federated query engines FedX and ANAPSID, and empirically evaluate their performance. Experimental results suggest that Fedra efficiently solves SSP-FR, reducing the number of selected SPARQL endpoints as well as the size of query intermediate results.

Author supplied keywords

Cite

CITATION STYLE

APA

Montoya, G., Skaf-Molli, H., Molli, P., & Vidal, M. E. (2015). Federated SPARQL queries processing with replicated fragments. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9366, pp. 36–51). Springer Verlag. https://doi.org/10.1007/978-3-319-25007-6_3

Federated SPARQL queries processing with replicated fragments

Abstract

Author supplied keywords

Cite

Register to see more suggestions