Abstract
NLP systems for tasks such as question answering and information extraction typically rely on statistical parsers. But the efficacy of such parsers can be surprisingly low, particularly for sentences drawn from heterogeneous corpora such as the Web. We have observed that incorrect parses often result in wildly implausible semantic interpretations of sentences, which can be detected automatically using semantic information obtained from the Web. Based on this observation, we introduce Web-based semantic filtering-a novel, domain-independent method for automatically detecting and discarding incorrect parses. We measure the effectiveness of our filtering system, called WOODWARD, on two test collections. On a set of TREC questions, it reduces error by 67%. On a set of more complex Penn Treebank sentences, the reduction in error rate was 20%. © 2006 Association for Computational Linguistics.
Cite
CITATION STYLE
Yates, A., Schoenmackers, S., & Etzioni, O. (2006). Detecting parser errors using Web-based semantic filters. In COLING/ACL 2006 - EMNLP 2006: 2006 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (pp. 27–34). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1610075.1610080
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.