Token-based typology and word order entropy: A study based on Universal Dependencies

Natalia Levshina

Journal ArticleOPEN ACCESS

Token-based typology and word order entropy: A study based on Universal Dependencies

Levshina N

Linguistic Typology (2019) 23(3) 533-572

DOI: 10.1515/lingty-2019-0025

82Citations

28Readers

Abstract

The present paper discusses the benefits and challenges of token-based typology, which takes into account the frequencies of words and constructions in language use. This approach makes it possible to introduce new criteria for language classification, which would be difficult or impossible to achieve with the traditional, type-based approach. This point is illustrated by several quantitative studies of word order variation, which can be measured as entropy at different levels of granularity. I argue that this variation can be explained by general functional mechanisms and pressures, which manifest themselves in language use, such as optimization of processing (including avoidance of ambiguity) and grammaticalization of predictable units occurring in chunks. The case studies are based on multilingual corpora, which have been parsed using the Universal Dependencies annotation scheme.

Author supplied keywords

Cite

CITATION STYLE

APA

Levshina, N. (2019). Token-based typology and word order entropy: A study based on Universal Dependencies. Linguistic Typology, 23(3), 533–572. https://doi.org/10.1515/lingty-2019-0025

Token-based typology and word order entropy: A study based on Universal Dependencies

Abstract

Author supplied keywords

Cite

Register to see more suggestions