Exploiting structural information for text classification on the WWW

Johannes Fürnkranz

Conference Proceedings

Exploiting structural information for text classification on the WWW

Fürnkranz J

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (1999) 1642 487-497

DOI: 10.1007/3-540-48412-4_41

82Citations

32Readers

Get full text

Abstract

In this paper, we report on a set of experiments that explore the utility of making use of the structural information of WWW documents. Our working hypothesis is that it is often easier to classify a hypertext page using information provided on pages that point to it instead of using information that is provided on the page itself. We present experimental evidence that confirms this hypothesis on a set of Web-pages that relate to Computer Science Departments.

Cite

CITATION STYLE

APA

Fürnkranz, J. (1999). Exploiting structural information for text classification on the WWW. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1642, pp. 487–497). Springer Verlag. https://doi.org/10.1007/3-540-48412-4_41

Exploiting structural information for text classification on the WWW

Abstract

Cite

Register to see more suggestions