Web pages contain information in several forms. These include textual information such as words and visual information such as images, use of color, and layout. We propose a method of extracting the characteristic features from both the textual and visual information in Web pages. Our method enables seamless integration of the two types of information and automatic extraction of their characteristic features. Based on this method, we developed a proof-of-concept system called Robin, which is designed to provide users with an intuitive way of browsing search engine results. The results of an experimental evaluation of the system showed that it has the potential to be practical and effective. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Oka, M., Tsukada, H., & Kato, K. (2006). Robin: Extracting visual and textual features from Web pages. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3841 LNCS, pp. 765–771). https://doi.org/10.1007/11610113_71
Mendeley helps you to discover research relevant for your work.