Abstract
Formulaic language is widely acknowledged to be a central part of a language. However, it is heterogeneous in nature, made up of various formulaic categories with their own characteristics and behaviour. A first step towards systematically describing the relationship between these categories is to describe their distribution in language. This study investigated the frequency of occurrence of four categories of formulaic sequences: collocations, phrasal verbs, idiomatic phrases, and lexical bundles. Together the four categories made up about 41% of English, with lexical bundles being by far the most common, followed by collocations, idiomatic phrases and phrasal verbs. There were differences in the frequencies of each category in the overall corpus, and also in the four registers analysed (academic prose, fiction, newspaper language, and spoken conversation). Language mode (spoken/written) had a substantial effect on the frequency distribution of the categories as well.
Cite
CITATION STYLE
Vilkaitė, L. (2016). Formulaic language is not all the same: comparing the frequency of idiomatic phrases, collocations, lexical bundles, and phrasal verbs. Taikomoji Kalbotyra, (8), 28–54. https://doi.org/10.15388/tk.2016.17505
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.