Abstract
This paper describes a log-linear model with an n-gram reference distribution for accurate probabilistic HPSG parsing. In the model, the n-gram reference distribution is simply defined as the product of the probabilities of selecting lexical entries, which are provided by the discriminative method with machine learning features of word and POS n-gram as defined in the CCG/HPSG/CDG supertagging. Recently, supertagging becomes well known to drastically improve the parsing accuracy and speed, but supertagging techniques were heuristically introduced, and hence the probabilistic models for parse trees were not well defined. We introduce the supertagging probabilities as a reference distribution for the log-linear model of the probabilistic HPSG. This is the first model which properly incorporates the supertagging probabilities into parse tree’s probabilistic model.
Cite
CITATION STYLE
Ninomiya, T., Matsuzaki, T., Miyao, Y., & Tsujii, J. (2007). A log-linear model with an n-gram reference distribution for accurate HPSG parsing. In IWPT 2007 - Proceedings of the 10th International Conference on Parsing Technologies (pp. 60–68). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1621410.1621418
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.