We present a new approach to inducing the syntactic categories of words, combining their distributional and morphological properties in a joint nonparametric Bayesian model based on the distance-dependent Chinese Restaurant Process. The prior distribution over word clusterings uses a log-linear model of morphological similarity; the likelihood function is the probability of generating vector word embeddings. The weights of the morphology model are learned jointly while inducing part-of-speech clusters, encouraging them to cohere with the distributional features. The resulting algorithm outperforms competitive alternatives on English POS induction. © 2014 Association for Computational Linguistics.
CITATION STYLE
Sirts, K., Eisenstein, J., Elsner, M., & Goldwater, S. (2014). POS induction with distributional and morphological information using a distance-dependent Chinese Restaurant Process. In 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference (Vol. 2, pp. 265–271). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p14-2044
Mendeley helps you to discover research relevant for your work.