We propose a new web information extraction system, PIES, to convert web information into XML documents. PIES uses a user-specified ontology and HTML tag pattern descriptions. The ontology validates the web information the pattern descriptions extract. We designed a new language to describe HTML tag patterns and extraction rules. We implemented PIES and applied it to the US patent web site for evaluation. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Park, B. K., Han, H., & Song, I. Y. (2005). PIES: A web information extraction system using ontology and tag patterns. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3739 LNCS, pp. 688–693). https://doi.org/10.1007/11563952_65
Mendeley helps you to discover research relevant for your work.