POS induction with distributional and morphological information using a distance-dependent Chinese Restaurant Process

7Citations
Citations of this article
103Readers
Mendeley users who have this article in their library.

Abstract

We present a new approach to inducing the syntactic categories of words, combining their distributional and morphological properties in a joint nonparametric Bayesian model based on the distance-dependent Chinese Restaurant Process. The prior distribution over word clusterings uses a log-linear model of morphological similarity; the likelihood function is the probability of generating vector word embeddings. The weights of the morphology model are learned jointly while inducing part-of-speech clusters, encouraging them to cohere with the distributional features. The resulting algorithm outperforms competitive alternatives on English POS induction. © 2014 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Sirts, K., Eisenstein, J., Elsner, M., & Goldwater, S. (2014). POS induction with distributional and morphological information using a distance-dependent Chinese Restaurant Process. In 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference (Vol. 2, pp. 265–271). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p14-2044

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free