Abstract
Combinatory Categorial Grammar (CCG) is a lexical-ized grammar formalism in which words are associated with categories that, in combination with a small universal set of rules, specify the syntactic configurations in which they may occur. Previous work has shown that learning sequence models for CCG tagging can be improved by using priors that are sensitive to the formal properties of CCG as well as cross-linguistic universal. We extend this approach to the task of learning a full CCG parser from weak supervision. We present a Bayesian formulation for CCG parser induction that assumes only supervision in the form of an incomplete tag dictionary mapping some word types to sets of potential categories. Our approach outperforms a baseline model trained with uniform priors by exploiting universal, intrinsic properties of the CCG formalism to bias the model toward simpler, more cross-linguistically common categories.
Cite
CITATION STYLE
Garrette, D., Dyer, C., Baldridge, J., & Smith, N. A. (2015). Weakly-supervised grammar-informed Bayesian CCG parser learning. In Proceedings of the National Conference on Artificial Intelligence (Vol. 3, pp. 2246–2252). AI Access Foundation. https://doi.org/10.1609/aaai.v29i1.9516
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.