This paper presents an approach for applying inductive logic programming to information extraction from HTML documents structured as unranked ordered trees. We consider information extraction from Web resources that are abstracted as providing sets of tuples. Our approach is based on defining a new class of wrappers as a special class of logic programs - logic wrappers. The approach is demonstrated with examples and experimental results in the area of collecting product information, highlighting the advantages and the limitations of the method. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Bǎdicǎ, C., Bǎdicǎ, A., & Popescu, E. (2005). Tuples extraction from HTML using logic wrappers and inductive logic programming. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3528 LNAI, pp. 44–50). Springer Verlag. https://doi.org/10.1007/11495772_8
Mendeley helps you to discover research relevant for your work.