Recognition of multiword expressions using word embeddings

10Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper we consider the task of extracting multiword expressions (MWE) for Russian thesaurus RuThes, which contains various types of phrases, including non-compositional phrases, multiword terms and their variants, light verb constructions, and others. We study several embedding-based features for phrases and their components and estimate their contribution to finding multiword expressions of different types comparing them with traditional association and context measures. We found that one of the distributional features has relatively high results of MWE extraction even when used alone. Different forms of its combination with other features (phrase frequency, association measures) improve both initial orderings. Besides, we demonstrate significant potential of an existing thesaurus for recognition of new multiword expressions for adding to the thesaurus.

Cite

CITATION STYLE

APA

Loukachevitch, N., & Parkhomenko, E. (2018). Recognition of multiword expressions using word embeddings. In Communications in Computer and Information Science (Vol. 934, pp. 112–124). Springer Verlag. https://doi.org/10.1007/978-3-030-00617-4_11

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free