Declarative information extraction, Web crawling, and recursive wrapping with Lixto

53Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Lixto is a system and method for the visual and interactive generation of wrappers for Web pages under the supervision of a human developer, for automatically extracting information from Web pages using such wrappers, and for translating the extracted content into XML. This paper describes some advanced features of Lixto, such as disjunctive pattern definitions, specialization rules, and Lixto's capability of collecting and aggregating information from several linked Web pages. © Springer-Verlag Berlin Heidelberg 2001.

Cite

CITATION STYLE

APA

Baumgartner, R., Flesca, S., & Gottlob, G. (2001). Declarative information extraction, Web crawling, and recursive wrapping with Lixto. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2173 LNAI, pp. 21–41). https://doi.org/10.1007/3-540-45402-0_2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free