Statistical comparability: Methodological caveats

1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The notion of comparable corpora implies the notion of comparability. The present paper aims at explicating this notion with respect to statistical methods because statistical comparison requires the use of statistical tests, which again require certain properties of the data under analysis. Linguistic data, however, do not automatically meet these requirements. In corpus linguistics and other linguistic fields, statistical methods are often applied without any previous check of their applicability. The paper will give some warnings and show some examples of corresponding test procedures. A number of other frequently used terms and concepts, such as representativeness, homogeneity, and balanced corpora, play a central role in corpus-linguistic argumentations and will be analysed in the paper, too, as they concern compilation and use of comparable corpora.

Cite

CITATION STYLE

APA

Köhler, R. (2013). Statistical comparability: Methodological caveats. In Building and Using Comparable Corpora (pp. 77–91). Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-20128-8_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free