Attribute retrieval from relational web tables

Arlind Kopliku; Karen Pinel-Sauvagnat; Mohand Boughanem

Conference Proceedings

Attribute retrieval from relational web tables

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 7024 LNCS 117-128

DOI: 10.1007/978-3-642-24583-1_12

5Citations

6Readers

Get full text

Abstract

In this paper, we propose an attribute retrieval approach which extracts and ranks attributes from HTML tables. Given an instance (e.g. Tower of Pisa), we want to retrieve from the Web its attributes (e.g. height, architect). Our approach uses HTML tables which are probably the largest source for attribute retrieval. Three recall oriented filters are applied over tables to check the following three properties: (i) is the table relational, (ii) has the table a header, and (iii) the conformity of its attributes and values. Candidate attributes are extracted from tables and ranked with a combination of relevance features. Our approach can be applied to all instances and is shown to have a high recall and a reasonable precision. Moreover, it outperforms state of the art techniques. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Kopliku, A., Pinel-Sauvagnat, K., & Boughanem, M. (2011). Attribute retrieval from relational web tables. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7024 LNCS, pp. 117–128). https://doi.org/10.1007/978-3-642-24583-1_12

Attribute retrieval from relational web tables

Abstract

Author supplied keywords

Cite

Register to see more suggestions