Head-driven PCFGs with latent-head statistics

17Citations
Citations of this article
79Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Although state-of-the-art parsers for natural language are lexicalized, it was recently shown that an accurate unlexicalized parser for the Penn tree-bank can be simply read off a manually refined treebank. While lexicalized parsers often suffer from sparse data, manual mark-up is costly and largely based on individual linguistic intuition. Thus, across domains, languages, and tree-bank annotations, a fundamental question arises: Is it possible to automatically induce an accurate parser from a tree-bank without resorting to full lexicalization? In this paper, we show how to induce head-driven probabilistic parsers with latent heads from a tree-bank. Our automatically trained parser has a performance of 85.7% (LP/LR F1), which is already better than that of early lexicalized ones.

Cite

CITATION STYLE

APA

Prescher, D. (2005). Head-driven PCFGs with latent-head statistics. In IWPT 2005 - Proceedings of the 9th International Workshop on Parsing Technologies (pp. 115–124). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1654494.1654506

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free