Due to the development of IT and scientific technology, huge amounts of data are continuously being created and the big data era can be said to have arrived. Therefore, triple store inserting and inquiring into knowledge bases has to be scaled up in order to deal with such large sources of data. To this end, we propose a triple store system based on a distributed database that uses bulk-loading for billions of triples to store data and to respond to user queries quickly. In order to achieve this purpose, we introduce a bulk-loading algorithm using the MapReduce framework and the SPARQL query processing engine to connect to a large distributed database. Experimental results show that the proposed bulk-loading algorithm can use 101K triples per second to load approximately 33 billion triples. This implies that we will be able to deal with billions of triples.
CITATION STYLE
Um, J. H., Lee, S., Kim, T. H., Jeong, C. H., Seo, K., Park, J., & Jung, H. (2014). MapReduce–based bulk–loading algorithm for fast search for billions of triples. In Lecture Notes in Electrical Engineering (Vol. 330, pp. 1139–1145). Springer Verlag. https://doi.org/10.1007/978-3-662-45402-2_161
Mendeley helps you to discover research relevant for your work.