This work studies methods of annotating Web tables for semantic indexing and search labeling table columns with semantic type information and linking content cells with named entities. Built on a state-of-the-art method, the focus is placed on developing and evaluating methods able to achieve the goals with partial content sampled from the table as opposed to using the entire table content as typical state-of-the-art methods would otherwise do. The method starts by annotating table columns using a sample automatically selected based on the data in the table, then using the type information to guide content cell disambiguation. Different methods of sample selection are introduced, and experiments show that they contribute to higher accuracy in cell disambiguation, comparable accuracy in column type annotation but with reduced computational overhead.
CITATION STYLE
Zhang, Z. (2014). Learning with partial data for semantic table interpretation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8876, pp. 607–618). Springer Verlag. https://doi.org/10.1007/978-3-319-13704-9_45
Mendeley helps you to discover research relevant for your work.