We introduce a provably correct learning algorithm for latent-variable PCFGs. The algorithm relies on two steps: first, the use of a matrix-decomposition algorithm applied to a co-occurrence matrix estimated from the parse trees in a training sample; second, the use of EM applied to a convex objective derived from the training samples in combination with the output from the matrix decomposition. Experiments on parsing and a language modeling problem show that the algorithm is efficient and effective in practice. © 2014 Association for Computational Linguistics.
CITATION STYLE
Cohen, S. B., & Collins, M. (2014). A provably correct learning algorithm for latent-variable PCFGs. In 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference (Vol. 1, pp. 1052–1061). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p14-1099
Mendeley helps you to discover research relevant for your work.