Neural lemmatization of multiword expressions

5Citations
Citations of this article
79Readers
Mendeley users who have this article in their library.

Abstract

This article focuses on the lemmatization of multiword expressions (MWEs). We propose a deep encoder-decoder architecture generating for every MWE word its corresponding part in the lemma, based on the internal context of the MWE. The encoder relies on recurrent networks based on (1) the character sequence of the individual words to capture their morphological properties, and (2) the word sequence of the MWE to capture lexical and syntactic properties. The decoder in charge of generating the corresponding part of the lemma for each word of the MWE is based on a classical character-level attention-based recurrent model. Our model is evaluated for Italian, French, Polish and Portuguese and shows good performances except for Polish.

Cite

CITATION STYLE

APA

Schmitt, M., & Constant, M. (2019). Neural lemmatization of multiword expressions. In ACL 2019 - Joint Workshop on Multiword Expressions and WordNet, MWE-WN 2019 - Proceedings of the Workshop (pp. 142–148). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-5117

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free