In this paper, we report on a set of experiments that explore the utility of making use of the structural information of WWW documents. Our working hypothesis is that it is often easier to classify a hypertext page using information provided on pages that point to it instead of using information that is provided on the page itself. We present experimental evidence that confirms this hypothesis on a set of Web-pages that relate to Computer Science Departments.
CITATION STYLE
Fürnkranz, J. (1999). Exploiting structural information for text classification on the WWW. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1642, pp. 487–497). Springer Verlag. https://doi.org/10.1007/3-540-48412-4_41
Mendeley helps you to discover research relevant for your work.