Related work

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In the literature, different approaches have been proposed to address the problem of extracting valuable data from the Web. In this chapter is presented an overview of such approaches. It begins by presenting a broad set of Web extraction methods and tools. Following a taxonomy previously used in the literature (Laender et al. 2002), they are divided into distinct groups according to their main approach. These groups are: Languages for Wrapper Development, Wrapper Induction Methods, NLP-based Methods, Ontology-based Methods, and HTML-aware Methods. Next, it is specifically presented probabilistic graph-based methods, supervised and unsupervised, and discusses their main characteristics in comparison to the unsupervised approach presented in this book.

Cite

CITATION STYLE

APA

Cortez, E., & da Silva, A. S. (2013). Related work. In SpringerBriefs in Computer Science (Vol. 0, pp. 9–17). Springer. https://doi.org/10.1007/978-3-319-02597-1_2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free