Exploring long-term temporal trends in the use of Multiword Expressions

2Citations
Citations of this article
69Readers
Mendeley users who have this article in their library.

Abstract

Differentiating between outdated expressions and current expressions is not a trivial task for foreign language learners, and could be beneficial for lexicographers, as they examine expressions. Assuming that the usage of expressions over time can be represented by a time-series of their periodic frequencies over a large lexicographic corpus, we test the hypothesis that there exists an old-new relationship between the time-series of some synonymous expressions, a hint that a later expression has replaced an earlier one. Another hypothesis we test is that Multiword Expressions (MWEs) can be characterized by sparsity & frequency thresholds. Using a dataset of 1 million English books, we choose MWEs having the most positive or the most negative usage trends from a ready-made list of known MWEs. We identify synonyms of those expressions in a historical thesaurus and visualize the temporal relationships between the resulting expression pairs. Our empirical results indicate that old-new usage relationships do exist between some synonymous expressions, and that new candidate expressions, not found in dictionaries, can be found by analyzing usage trends.

Cite

CITATION STYLE

APA

Daniel, T., & Last, M. (2016). Exploring long-term temporal trends in the use of Multiword Expressions. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 11–20). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-1802

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free