Robin: Extracting visual and textual features from Web pages

Mizuki Oka; Hiroshi Tsukada; Kazuhiko Kato

Conference Proceedings

Robin: Extracting visual and textual features from Web pages

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 3841 LNCS 765-771

DOI: 10.1007/11610113_71

0Citations

3Readers

Get full text

Abstract

Web pages contain information in several forms. These include textual information such as words and visual information such as images, use of color, and layout. We propose a method of extracting the characteristic features from both the textual and visual information in Web pages. Our method enables seamless integration of the two types of information and automatic extraction of their characteristic features. Based on this method, we developed a proof-of-concept system called Robin, which is designed to provide users with an intuitive way of browsing search engine results. The results of an experimental evaluation of the system showed that it has the potential to be practical and effective. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Oka, M., Tsukada, H., & Kato, K. (2006). Robin: Extracting visual and textual features from Web pages. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3841 LNCS, pp. 765–771). https://doi.org/10.1007/11610113_71

Robin: Extracting visual and textual features from Web pages

Abstract

Cite

Register to see more suggestions