A novel framework for web page classification using two-stage neural network

3Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Web page classification is one of the essential techniques for Web mining. This paper presents a framework for Web page classification. It is hybrid architecture of neural network PCA (principle components analysis) and SOFM (self-organizing map). In order to perform the classification, a web page is firstly represented by a vector of features with different weights according to the term frequency and the importance of each sentence in the page. As the number of the features is big, PCA is used to select the relevant features. Finally the output of PCA is sent to SOFM for classification. To compare with the proposed framework, two conventional classifiers are used in our experiments: k-NN and Naïve Bayes. Our new method makes a significant improvement in classifications on both data sets compared with the two conventional methods. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Li, Y., Cao, Y., Zhu, Q., & Zhu, Z. (2005). A novel framework for web page classification using two-stage neural network. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3584 LNAI, pp. 499–506). Springer Verlag. https://doi.org/10.1007/11527503_60

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free