Token-based typology and word order entropy: A study based on Universal Dependencies

82Citations
Citations of this article
28Readers
Mendeley users who have this article in their library.

Abstract

The present paper discusses the benefits and challenges of token-based typology, which takes into account the frequencies of words and constructions in language use. This approach makes it possible to introduce new criteria for language classification, which would be difficult or impossible to achieve with the traditional, type-based approach. This point is illustrated by several quantitative studies of word order variation, which can be measured as entropy at different levels of granularity. I argue that this variation can be explained by general functional mechanisms and pressures, which manifest themselves in language use, such as optimization of processing (including avoidance of ambiguity) and grammaticalization of predictable units occurring in chunks. The case studies are based on multilingual corpora, which have been parsed using the Universal Dependencies annotation scheme.

Cite

CITATION STYLE

APA

Levshina, N. (2019). Token-based typology and word order entropy: A study based on Universal Dependencies. Linguistic Typology, 23(3), 533–572. https://doi.org/10.1515/lingty-2019-0025

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free