Dispersions and adjusted frequencies in corpora

Stefan Th. Gries

Journal Article

Dispersions and adjusted frequencies in corpora

Gries S

International Journal of Corpus Linguistics (2008) 13(4) 403-437

DOI: 10.1075/ijcl.13.4.02gri

212Citations

198Readers

Get full text

Abstract

The most frequent statistics in corpus linguistics are frequencies of occurrence and frequencies of co-occurrence of two or more linguistic variables. However, such frequencies in isolation may sometimes be misleading since they do not take into consideration the degree of dispersion of the relevant linguistic variable. Many dispersion measures and adjusted frequency measures have been suggested but are neither widely known nor applied. Another unfortunate aspect of such measures is that many also come with a variety of problems. I pursue three objectives with this article. First, I want to raise awareness of this issue and make the available measures more widely known, so I present an overview of many measures of dispersion and adjusted frequencies. Second, I propose a conceptually simple alternative measure, DP , explain and exemplify it, and compare it to previously discussed measures. Third and most importantly, I urge corpus linguists to explore the notion of dispersion in more detail and outline a few proposals which steps to take next.

Cite

CITATION STYLE

APA

Gries, S. Th. (2008). Dispersions and adjusted frequencies in corpora. International Journal of Corpus Linguistics, 13(4), 403–437. https://doi.org/10.1075/ijcl.13.4.02gri

Dispersions and adjusted frequencies in corpora

Abstract

Cite

Register to see more suggestions