An interface agent for wrapper-based information extraction

Jaeyoung Yang; Tae Hyung Kim; Joongmin Choi

Conference Proceedings

An interface agent for wrapper-based information extraction

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2005) 3371 291-302

DOI: 10.1007/978-3-540-32128-6_22

6Citations

5Readers

Get full text

Abstract

This paper proposes a new method of building information extraction rules for Web documents by exploiting a user interface agent that combines the manual and automatic approaches of rule generation. We adopt the scheme of supervised learning in which the interface agent is designed to get information from the user regarding what to extract from a document and XML-based wrappers are generated according to these inputs. The interface agent is used not only to generate new extraction rules but also to modify and extend existing ones to enhance the precision and the recall measures of Web information extraction systems. We have done a series of experiments to test the system, and the results are very promising. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Yang, J., Kim, T. H., & Choi, J. (2005). An interface agent for wrapper-based information extraction. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 3371, pp. 291–302). Springer Verlag. https://doi.org/10.1007/978-3-540-32128-6_22

An interface agent for wrapper-based information extraction

Abstract

Cite

Register to see more suggestions