Skip to content
Conference proceedings

OWL reasoning with WebPIE: Calculating the closure of 100 billion triples

Urbani J, Kotoulas S, Maassen J, Van Harmelen F, Bal H...(+5 more)

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 6088 LNCS, issue PART 1 (2010) pp. 213-227

  • 86


    Mendeley users who have this article in their library.
  • 76


    Citations of this article.
  • N/A


    ScienceDirect users who have downloaded this article.
Sign in to save reference


In previous work we have shown that the MapReduce framework for distributed computation can be deployed for highly scalable inference over RDF graphs under the RDF Schema semantics. Unfortunately, several key optimizations that enabled the scalable RDFS inference do not generalize to the richer OWL semantics. In this paper we analyze these problems, and we propose solutions to overcome them. Our solutions allow distributed computation of the closure of an RDF graph under the OWL Horst semantics. We demonstrate the WebPIE inference engine, built on top of the Hadoop platform and deployed on a compute cluster of 64 machines. We have evaluated our approach using some real-world datasets (UniProt and LDSR, about 0.9-1.5 billion triples) and a synthetic benchmark (LUBM, up to 100 billion triples). Results show that our implementation is scalable and vastly outperforms current systems when comparing supported language expressivity, maximum data size and inference speed.

Find this document

Get full text

Cite this document

Choose a citation style from the tabs below