Abstract
State of the art set expansion algorithms produce varying quality expansions for different entity types. Even for the highest quality expansions, errors still occur and manual refinements are necessary for most practical uses. In this paper, we propose algorithms to aide this refinement process, greatly reducing the amount of manual labor required. The methods rely on the fact that most expansion errors are systematic, often stemming from the fact that some seed elements are ambiguous. Using our methods, empirical evidence shows that average R-precision over random entity sets improves by 26% to 51% when given from 5 to 10 manually tagged errors. Both proposed refinement models have linear time complexity in set size allowing for practical online use in set expansion systems. © 2009 Association for Computational Linguistics.
Cite
CITATION STYLE
Vyas, V., & Pantel, P. (2009). Semi-automatic entity set refinement. In NAACL HLT 2009 - Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Conference (pp. 290–298). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1620754.1620796
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.