A visual based page segmentation for deep web data extraction

Vikas R. Palekar

Conference Proceedings

A visual based page segmentation for deep web data extraction

Palekar V

Advances in Intelligent and Soft Computing (2012) 131 AISC(VOL. 2) 791-804

DOI: 10.1007/978-81-322-0491-6_72

0Citations

2Readers

Get full text

Abstract

A new web content structure analysis based on visual representation is proposed in this paper. Many web applications such as information retrieval, information extraction and automatic page adaptation can benefit from this structure. This paper presents an automatic top-down, tag-tree independent approach to detect web content structure. It simulates how a user understands web layout structure based on his visual perception. Comparing to other existing techniques such as DOM tree, our approach is independent to the HTML documentation representation. Our method can work well even when the HTML structure is quite different from the visual layout structure. © 2012 Springer India Pvt. Ltd.

Author supplied keywords

Cite

CITATION STYLE

APA

Palekar, V. R. (2012). A visual based page segmentation for deep web data extraction. In Advances in Intelligent and Soft Computing (Vol. 131 AISC, pp. 791–804). https://doi.org/10.1007/978-81-322-0491-6_72

A visual based page segmentation for deep web data extraction

Abstract

Author supplied keywords

Cite

Register to see more suggestions