Fitting ranked linguistic data with two-parameter functions

48Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.

Abstract

It is well known that many ranked linguistic data can fit well with one-parameter models such as Zipf's law for ranked word frequencies. However, in cases where discrepancies from the one-parameter model occur (these will come at the two extremes of the rank), it is natural to use one more parameter in the fitting model. In this paper, we compare several two-parameter models, including Beta function, Yule function, Weibull function-all can be framed as a multiple regression in the logarithmic scale-in their fitting performance of several ranked linguistic data, such as letter frequencies, word-spacings, and word frequencies. We observed that Beta function fits the ranked letter frequency the best, Yule function fits the ranked word-spacing distribution the best, and Altmann, Beta, Yule functions all slightly outperform the Zipf's power-law function in word ranked-frequency distribution. © 2010.

Cite

CITATION STYLE

APA

Li, W., Miramontes, P., & Cocho, G. (2010). Fitting ranked linguistic data with two-parameter functions. Entropy, 12(7), 1743–1764. https://doi.org/10.3390/e12071743

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free