Abstract
The Web encompasses a significant amount of knowledge hidden in entity-attributes tables. Bridging the gap between these tables and theWeb of Data thus has the potential to facilitate a large number of applications, including the augmentation of knowledge bases from tables, the search for related tables and the completion of tables using knowledge bases. Computing such bridges is impeded by the poor accuracy of automatic property mapping, the lack of approaches for the discovery of subject columns and the mere size of table corpora. We propose Taipan, a novel approach for recovering the semantics of tables. Our approach begins by identifying subject columns using a combination of structural and semantic features. It then maps binary relations inside a table to predicates from a given knowledge base. Therewith, our solution supports both the tasks of table expansion and knowledge base augmentation. We evaluate our approach on a table dataset generated from real RDF data and a manually curated version of the T2D gold standard. Our results suggest that we outperform the state of the art by up to 85% F-measure.
Author supplied keywords
Cite
CITATION STYLE
Ermilov, I., & Ngomo, A. C. N. (2016). TAIPAN: Automatic property mapping for tabular data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10024 LNAI, pp. 163–179). Springer Verlag. https://doi.org/10.1007/978-3-319-49004-5_11
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.