Estimating linguistic diversity on the internet: A taxonomy to avoid pitfalls and paradoxes

Peter Gerrand

Journal ArticleOPEN ACCESS

Estimating linguistic diversity on the internet: A taxonomy to avoid pitfalls and paradoxes

Gerrand P

Journal of Computer-Mediated Communication (2007) 12(4) 1298-1321

DOI: 10.1111/j.1083-6101.2007.00374.x

16Citations

76Readers

Abstract

Both UNESCO and OECD have recognized the public policy benefit of publicizing information on linguistic diversity on the Internet. However, the published methodologies for estimating "linguistic diversity" or "Internet statistics (by language)" do so with different interpretations of these key terms. This article creates a new taxonomy, defining and contrasting user activity, user profile, web presence, and diversity index to distinguish among the various indicators used to estimate language usage on the Internet. This taxonomy facilitates comparisons of the available methodologies, whose limitations are then critiqued. It also helps to resolve the apparent paradox as to whether the use of English on the Internet has declined rapidly or has remained fairly stable. The study concludes that the best estimates of web presence can be achieved by direct measurement: randomly addressing and analyzing a representative sample of all public websites. However, this approach will only suffice if the language detection software used is progressively extended to recognize all the world's written languages. © 2007 International Communication Association.

Cite

CITATION STYLE

APA

Gerrand, P. (2007). Estimating linguistic diversity on the internet: A taxonomy to avoid pitfalls and paradoxes. Journal of Computer-Mediated Communication, 12(4), 1298–1321. https://doi.org/10.1111/j.1083-6101.2007.00374.x

Estimating linguistic diversity on the internet: A taxonomy to avoid pitfalls and paradoxes

Abstract

Cite

Register to see more suggestions