Measuring language complexity using word embeddings

1Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The analysis of word patterns from a corpus has previously been examined using a number of different word embedding models. These models create a numeric representation of word co-occurrence and are able to capture some of the syntactic and semantic relationships of words in a document. Assessing language complexity has been considered for many years through the use of simple indexes and basic statistical properties (word frequency, etc.), however little work has been done on using word embeddings to develop language complexity measures. This paper describes preliminary work on measuring language complexity using clustered word embeddings to produce network transition models. The structural measures of these transition networks are shown to represent basic properties of language complexity and may be used to infer some aspects of the underlying generative grammar.

Cite

CITATION STYLE

APA

Whigham, P. A., Chugh, M., & Dick, G. (2018). Measuring language complexity using word embeddings. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11320 LNAI, pp. 843–854). Springer Verlag. https://doi.org/10.1007/978-3-030-03991-2_76

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free