Comprehensive analysis of web page classifier for Fsocused crawler

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Focused Crawler collects domain specific web page from the internet. However, the performance of focused web crawler depends upon the multidimensional nature of the web page. This paper presents a comprehensive analysis of recent web page classifiers for focused crawlers and also explores the impact of web-based feature in collaboration with web classifier. It also evaluates the performance of classification technique such as Support vector machine, Naive Bayes, Linear Regression and Random Forest over web page classification. Along with that it examines the impact of web feature i.e. anchor text, Page content and link over web page classification. Finally the paper yield interesting result about the collective response of web feature and classification technique for web page classification as a relevant class and irrelevant class.

Cite

CITATION STYLE

APA

Shrivastava, G. K., Kaushik, P., & Pateriya, R. K. (2019). Comprehensive analysis of web page classifier for Fsocused crawler. International Journal of Innovative Technology and Exploring Engineering, 8(9), 57–65. https://doi.org/10.35940/ijitee.i7477.078919

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free