A hierarchical Bayesian language model based on Pitman-Yor processes

Yee Whye Teh

Conference ProceedingsOPEN ACCESS

A hierarchical Bayesian language model based on Pitman-Yor processes

Teh Y

COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (2006) 1 985-992

DOI: 10.3115/1220175.1220299

374Citations

495Readers

Abstract

We propose a new hierarchical Bayesian n-gram model of natural languages. Our model makes use of a generalization of the commonly used Dirichlet distributions called Pitman-Yor processes which produce power-law distributions more closely resembling those in natural languages. We show that an approximation to the hierarchical Pitman-Yor language model recovers the exact formulation of interpolated Kneser-Ney, one of the best smoothing methods for n-gram language models. Experiments verify that our model gives cross entropy results superior to interpolated Kneser-Ney and comparable to modified Kneser-Ney. © 2006 Association for Computational Linguistics.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Teh, Y. W. (2006). A hierarchical Bayesian language model based on Pitman-Yor processes. In COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Vol. 1, pp. 985–992). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1220175.1220299

Readers' Seniority

PhD / Post grad / Masters / Doc 260

67%

Researcher 84

22%

Professor / Associate Prof. 34

Lecturer / Post doc 9

Readers' Discipline

Computer Science 299

80%

Engineering 31

Mathematics 25

Linguistics 21

Article Metrics

Mentions

References: 1

View details >

A hierarchical Bayesian language model based on Pitman-Yor processes

Abstract

References Powered by Scopus

A Neural Probabilistic Language Model

Hierarchical Dirichlet processes

Gibbs sampling methods for stick-breaking priors

Cited by Powered by Scopus

Bayesian nonparametrics

Bayesian unsupervised word segmentation with nested Pitman-Yor language modeling

The Handbook of Computational Linguistics and Natural Language Processing

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline

Article Metrics