UFRGS&LIF at SemEval-2016 task 10: Rule-based MWE identification and predominant-supersense tagging

9Citations
Citations of this article
78Readers
Mendeley users who have this article in their library.

Abstract

This paper presents our approach towards the SemEval-2016 Task 10 - Detecting Minimal Semantic Units and their Meanings. Systems are expected to provide a representation of lexical semantics by (1) segmenting tokens into words and multiword units and (2) providing a supersense tag for segments that function as nouns or verbs. Our pipeline rule-based system uses no external resources and was implemented using the mwetoolkit. First, we extract and filter known MWEs from the training corpus. Second, we group input tokens of the test corpus based on this lexicon, with special treatment for non-contiguous expressions. Third, we use an MWE-aware predominant-sense heuristic for supersense tagging. We obtain an F-score of 51.48% for MWE identification and 49.98% for supersense tagging.

Cite

CITATION STYLE

APA

Cordeiro, S. R., Ramisch, C., & Villavicencio, A. (2016). UFRGS&LIF at SemEval-2016 task 10: Rule-based MWE identification and predominant-supersense tagging. In SemEval 2016 - 10th International Workshop on Semantic Evaluation, Proceedings (pp. 910–917). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s16-1140

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free