WetDL: A web information extraction language

Benjamin Habegger; Mohamed Quafafou

Journal Article

WetDL: A web information extraction language

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3261 128-138

DOI: 10.1007/978-3-540-30198-1_14

3Citations

3Readers

Get full text

Abstract

Many online information sources are available on the Web. Giving machine access to such sources leads to many interesting applications, such as using web data in mediators or software agents. Up to now most work in the field of information extraction from the web has concentrated on building wrappers, i.e. programs allowing to reformat presentational data in HTML into a more machine comprehensible format. While being an important part of a web information extraction application such wrappers are not sufficient to fully access a source. Indeed, it is necessary to setup an infrastructure allowing to build queries, fetch pages, extract specific links, etc. In this paper we propose a language called WetDL allowing to describe an information extraction task as a network of operators whose execution performs the desired extraction task. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Habegger, B., & Quafafou, M. (2004). WetDL: A web information extraction language. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3261, 128–138. https://doi.org/10.1007/978-3-540-30198-1_14

WetDL: A web information extraction language

Abstract

Cite

Register to see more suggestions