We consider the problem of extracting texts related to a given keyword from Web pages collected by a search engine. Recently, we proposed a method using both structural and content information[1, 2]. In our previous paper, we reported good extraction performance of our method only for Ramen-shop dataset written in Japanese. In this paper, we examine it for datasets of other kind of restaurants, and also for a dataset written in English. We discuss some modification for performance improvement. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Hasegawa, H., Kudo, M., & Nakamura, A. (2005). Empirical study on usefulness of algorithm SACwRApper for reputation extraction from the WWW. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3684 LNAI, pp. 668–674). Springer Verlag. https://doi.org/10.1007/11554028_93
Mendeley helps you to discover research relevant for your work.