Multi-LEX: A database of multi-word frequencies for French and English

Marjorie Armando; Jonathan Grainger; Stephane Dufau

Journal ArticleOPEN ACCESS

Multi-LEX: A database of multi-word frequencies for French and English

Behavior Research Methods (2023) 55(8) 4315-4328

DOI: 10.3758/s13428-022-02018-9

1Citations

5Readers

Abstract

Written word frequency is a key variable used in many psycholinguistic studies and is central in explaining visual word recognition. Indeed, methodological advances on single-word frequency estimates have helped to uncover novel language-related cognitive processes, fostering new ideas and studies. In an attempt to support and promote research on a related emerging topic, visual multi-word recognition, we extracted from the exhaustive Google Ngram datasets a selection of millions of multi-word sequences and computed their associated frequency estimate. Such sequences are presented with part-of-speech information for each individual word. An online behavioral investigation making use of the French 4-gram lexicon in a grammatical decision task was carried out. The results show an item-level frequency effect of word sequences. Moreover, the proposed datasets were found useful during the stimulus selection phase, allowing more precise control of the multi-word characteristics.

Author supplied keywords

Cite

CITATION STYLE

APA

Armando, M., Grainger, J., & Dufau, S. (2023). Multi-LEX: A database of multi-word frequencies for French and English. Behavior Research Methods, 55(8), 4315–4328. https://doi.org/10.3758/s13428-022-02018-9

Multi-LEX: A database of multi-word frequencies for French and English

Abstract

Author supplied keywords

Cite

Register to see more suggestions