Related work

Eli Cortez; Altigran S. da Silva

Book Chapter

Related work

Springer, (2013), 9-17

DOI: 10.1007/978-3-319-02597-1_2

0Citations

2Readers

Get full text

Abstract

In the literature, different approaches have been proposed to address the problem of extracting valuable data from the Web. In this chapter is presented an overview of such approaches. It begins by presenting a broad set of Web extraction methods and tools. Following a taxonomy previously used in the literature (Laender et al. 2002), they are divided into distinct groups according to their main approach. These groups are: Languages for Wrapper Development, Wrapper Induction Methods, NLP-based Methods, Ontology-based Methods, and HTML-aware Methods. Next, it is specifically presented probabilistic graph-based methods, supervised and unsupervised, and discusses their main characteristics in comparison to the unsupervised approach presented in this book.

Author supplied keywords

Cite

CITATION STYLE

APA

Cortez, E., & da Silva, A. S. (2013). Related work. In SpringerBriefs in Computer Science (Vol. 0, pp. 9–17). Springer. https://doi.org/10.1007/978-3-319-02597-1_2

Related work

Abstract

Author supplied keywords

Cite

Register to see more suggestions