Focused crawling using latent semantic indexing - An application for vertical search engines

7Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Vertical search engines and web portals are gaining ground over the general-purpose engines due to their limited size and their high precision for the domain they cover. The number of vertical portals has rapidly increased over the last years, making the importance of a topic-driven (focused) crawler evident. In this paper, we develop a latent semantic indexing classifier that combines link analysis with text content in order to retrieve and index domain specific web documents. We compare its efficiency with other well-known web information retrieval techniques. Our implementation presents a different approach to focused crawling and aims to overcome the size limitations of the initial training data while maintaining a high recall/precision ratio. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Almpanidis, G., Kotropoulos, C., & Pitas, I. (2005). Focused crawling using latent semantic indexing - An application for vertical search engines. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3652 LNCS, pp. 402–413). https://doi.org/10.1007/11551362_36

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free