Learning to extract attribute values from a search engine with few examples

Xingxing Zhang; Tao Ge; Zhifang Sui

Conference Proceedings

Learning to extract attribute values from a search engine with few examples

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8202 LNAI 154-165

DOI: 10.1007/978-3-642-41491-6_15

1Citations

4Readers

Get full text

Abstract

We propose an attribute value extraction method based on analysing snippets from a search engine. First, a pattern based detector is applied to locate the candidate attribute values in snippets. Then a classifier is used to predict whether a candidate value is correct. To train such a classifier, only very few annotated triples are needed, and sufficient training data can be generated automatically by matching these triples back to snippets and titles. Finally, as a correct value may appear in multiple snippets, to exploit such redundant information, all the individual predictions are assembled together by voting. Experiments on both Chinese and English corpora in the celebrity domain demonstrate the effectiveness of our method: with only 15 annotated triples, 7 of 12 attributes' precisions are over 85%; Compared to a state-of-the-art method, 11 of 12 attributes have improvements. © Springer-Verlag 2013.

Cite

CITATION STYLE

APA

Zhang, X., Ge, T., & Sui, Z. (2013). Learning to extract attribute values from a search engine with few examples. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8202 LNAI, pp. 154–165). https://doi.org/10.1007/978-3-642-41491-6_15

Learning to extract attribute values from a search engine with few examples

Abstract

Cite

Register to see more suggestions