Unsupervised word embeddings have been shown to be valuable as features in supervised learning problems; however, their role in unsupervised problems has been less thoroughly explored. In this paper, we show that embeddings can likewise add value to the problem of unsupervised POS induction. In two representative models of POS induction, we replace multinomial distributions over the vocabulary with multivariate Gaussian distributions over word embeddings and observe consistent improvements in eight languages. We also analyze the effect of various choices while inducing word embeddings on "downstream" POS induction results.
CITATION STYLE
Lin, C. C., Ammar, W., Dyer, C., & Levin, L. (2015). Unsupervised POS induction with word embeddings. In NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 1311–1316). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/n15-1144
Mendeley helps you to discover research relevant for your work.