A selection algorithm for focused crawlers incorporating semantic metadata

Saurabh Wadwekar; Debajyoti Mukhopadhyay

Conference Proceedings

A selection algorithm for focused crawlers incorporating semantic metadata

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7753 LNCS 561-572

DOI: 10.1007/978-3-642-36071-8_45

0Citations

2Readers

Get full text

Abstract

The search results offered currently by majority of search portals are horizontal by nature. This denotes that these search engines intend to index as much web pages as possible and present search results based on these web pages. These results often offer generalized results. Focused Crawlers were built to download web pages relevant only to a pre-specified topic. Searching on these kinds of pages is called as Vertical Search, as it attempts to drill down on a single topic, rather than exploring a plethora of other pages on web which are related to search query in one way or another. In this paper, we propose an algorithm which helps a focused crawler decide whether a web page should be downloaded on not. The selection algorithm proposed in this paper makes use of semantic properties of the content to arrive at a decision. © 2013 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Wadwekar, S., & Mukhopadhyay, D. (2013). A selection algorithm for focused crawlers incorporating semantic metadata. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7753 LNCS, pp. 561–572). Springer Verlag. https://doi.org/10.1007/978-3-642-36071-8_45

A selection algorithm for focused crawlers incorporating semantic metadata

Abstract

Author supplied keywords

Cite

Register to see more suggestions