We present a report on our participation in the mixed monolingual web task of the 2005 Cross-Language Evaluation Forum (CLEF). We compared the result of web page retrieval based on the page content, page title, and a combination of page content and page title. The result shows that using the combination of page title resulted in the best retrieval performance compared to using only page content or page title. Taking into account the number of links referring to a web page and the depth of the directory path in its URL did not result in any significant improvement to the retrieval performance. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Adriani, M., & Pandugita, R. (2006). Using the Web information structure for retrieving Web pages. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4022 LNCS, pp. 892–897). Springer Verlag. https://doi.org/10.1007/11878773_99
Mendeley helps you to discover research relevant for your work.