MapReduce–based bulk–loading algorithm for fast search for billions of triples

Jung Ho Um; Seungwoo Lee; Tae Hong Kim; Chang Hoo Jeong; Kwangik Seo; Joonho Park; Hanmin Jung

Conference Proceedings

MapReduce–based bulk–loading algorithm for fast search for billions of triples

Lecture Notes in Electrical Engineering (2014) 330 1139-1145

DOI: 10.1007/978-3-662-45402-2_161

2Citations

2Readers

Get full text

Abstract

Due to the development of IT and scientific technology, huge amounts of data are continuously being created and the big data era can be said to have arrived. Therefore, triple store inserting and inquiring into knowledge bases has to be scaled up in order to deal with such large sources of data. To this end, we propose a triple store system based on a distributed database that uses bulk-loading for billions of triples to store data and to respond to user queries quickly. In order to achieve this purpose, we introduce a bulk-loading algorithm using the MapReduce framework and the SPARQL query processing engine to connect to a large distributed database. Experimental results show that the proposed bulk-loading algorithm can use 101K triples per second to load approximately 33 billion triples. This implies that we will be able to deal with billions of triples.

Author supplied keywords

Cite

CITATION STYLE

APA

Um, J. H., Lee, S., Kim, T. H., Jeong, C. H., Seo, K., Park, J., & Jung, H. (2014). MapReduce–based bulk–loading algorithm for fast search for billions of triples. In Lecture Notes in Electrical Engineering (Vol. 330, pp. 1139–1145). Springer Verlag. https://doi.org/10.1007/978-3-662-45402-2_161

MapReduce–based bulk–loading algorithm for fast search for billions of triples

Abstract

Author supplied keywords

Cite

Register to see more suggestions