Sign up & Download
Sign in

TripleCloud : An infrastructure for exploratory querying over Web-scale RDF data

by Christophe Guéret, Spyros Kotoulas, Paul Groth
Proceedings of Workshop on Webscale Knowledge Representation Retrieval and Reasoning WebKR3 2011 (2011)

Abstract

As the availability of large scale RDF data sets has grown, there has been a corresponding growth in researchers and practitioners interest in analyzing and investigating these data sets. However, given their size and messiness, there is significant overhead in setting up the infrastructure to store and query them. In this paper, we present TripleCloud, a system that aims to lower the entry cost to exploringWeb-scale RDF data sets. The system takes advantage of existing cloud based key-value stores (e.g. BigTable, HBase) to both enable scalability as well as hide the complexities of infrastructure deployment and maintenance. It layers over these key-value stores a robust query engine able to return approximate answers. We test the scalability of the approach scaling to over 3 billion triples for complex queries. In addition to an implementation over HBase, TripleCloud runs over the Google App Engine, allowing us to perform a cost evaluation of the approach.

Author-supplied keywords

Cite this document (BETA)

Sign up today - FREE

Mendeley saves you time finding and organizing research. Learn more

  • All your research in one place
  • Add and import papers easily
  • Access it anywhere, anytime

Start using Mendeley in seconds!

Already have an account? Sign in

Readership Statistics

3 Readers on Mendeley
by Discipline
 
by Academic Status
 
100% Post Doc
by Country
 
100% Netherlands