A segmentation method for web page analysis using shrinking and dividing

33Citations
Citations of this article
32Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

On the basis of image processing technology and characteristics of web pages, a new web segmentation method - iterated shrinking and dividing is proposed in this paper. Dividing conditions and concept of dividing zone are introduced, based on which web page image is divided into visually consentaneous sub-images by shrinking and splitting iteratively. First, the web page is saved as image that is preprocessed by edge detection algorithm such as Canny. Then dividing zones are detected and the web image is segmented repeatedly until all blocks are indivisible. This method can be used to analyse the web pages such as detecting similar visual layout. Experiments show that the algorithm is suitable for web page segmentation, and does well in expansibility and performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Cao, J., Mao, B., & Luo, J. (2010). A segmentation method for web page analysis using shrinking and dividing. International Journal of Parallel, Emergent and Distributed Systems, 25(2), 93–104. https://doi.org/10.1080/17445760802429585

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free