Web page classification based on k-nearest neighbor approach

59Citations
Citations of this article
31Readers
Mendeley users who have this article in their library.

Abstract

Automatic categorization is the only viable method to deal with the scaling problem of the World Wide Web. In this paper, we propose a Web page classifier based on an adaptation of k-Nearest Neighbor (k- NN) approach. To improve the performance of k-NN approach, we supplement k-NN approach with a feature selection method and a term-weighting scheme using markup tags, and reform documentdocument similarity measure used in vector space model. In our experiments on a Korean commercial Web directory, our proposed methods in k-NN approach for Web page classification improved the performance of classification.

Cite

CITATION STYLE

APA

Kwon, O. W., & Lee, J. H. (2000). Web page classification based on k-nearest neighbor approach. In Proceedings of the 5th international Workshop on Information Retrieval with Asian Languages, IRAL 2000 (pp. 9–15). Association for Computing Machinery, Inc. https://doi.org/10.1145/355214.355216

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free