MultiCrawler: A pipelined architecture for crawling and indexing semantic web data

Andreas Harth; Jürgen Umbrich; Stefan Decker

Conference ProceedingsOPEN ACCESS

MultiCrawler: A pipelined architecture for crawling and indexing semantic web data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4273 LNCS 258-271

DOI: 10.1007/11926078_19

47Citations

143Readers

Abstract

The goal of the work presented in this paper is to obtain large amounts of semistructured data from the web. Harvesting semistructured data is a prerequisite to enabling large-scale query answering over web sources. We contrast our approach to conventional web crawlers, and describe and evaluate a five-step pipelined architecture to crawl and index data from both the traditional and the Semantic Web. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Harth, A., Umbrich, J., & Decker, S. (2006). MultiCrawler: A pipelined architecture for crawling and indexing semantic web data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4273 LNCS, pp. 258–271). Springer Verlag. https://doi.org/10.1007/11926078_19

MultiCrawler: A pipelined architecture for crawling and indexing semantic web data

Abstract

Cite

Register to see more suggestions