UFRGS&LIF at SemEval-2016 task 10: Rule-based MWE identification and predominant-supersense tagging

Silvio Ricardo Cordeiro; Carlos Ramisch; Aline Villavicencio

Conference ProceedingsOPEN ACCESS

UFRGS&LIF at SemEval-2016 task 10: Rule-based MWE identification and predominant-supersense tagging

SemEval 2016 - 10th International Workshop on Semantic Evaluation, Proceedings (2016) 910-917

DOI: 10.18653/v1/s16-1140

9Citations

78Readers

Abstract

This paper presents our approach towards the SemEval-2016 Task 10 - Detecting Minimal Semantic Units and their Meanings. Systems are expected to provide a representation of lexical semantics by (1) segmenting tokens into words and multiword units and (2) providing a supersense tag for segments that function as nouns or verbs. Our pipeline rule-based system uses no external resources and was implemented using the mwetoolkit. First, we extract and filter known MWEs from the training corpus. Second, we group input tokens of the test corpus based on this lexicon, with special treatment for non-contiguous expressions. Third, we use an MWE-aware predominant-sense heuristic for supersense tagging. We obtain an F-score of 51.48% for MWE identification and 49.98% for supersense tagging.

Cite

CITATION STYLE

APA

Cordeiro, S. R., Ramisch, C., & Villavicencio, A. (2016). UFRGS&LIF at SemEval-2016 task 10: Rule-based MWE identification and predominant-supersense tagging. In SemEval 2016 - 10th International Workshop on Semantic Evaluation, Proceedings (pp. 910–917). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s16-1140

UFRGS&LIF at SemEval-2016 task 10: Rule-based MWE identification and predominant-supersense tagging

Abstract

Cite

Register to see more suggestions