Influence of treebank design on representation of multiword expressions

3Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Multiword Expressions (MWEs) are important linguistic units that require special treatment in many NLP applications. It is thus desirable to be able to recognize them automatically. Semantically annotated corpora should mark MWEs in a clear way that facilitates development of automatic recognition tools. In the present paper we discuss various corpus design decisions from this perspective. We propose guidelines that should lead to MWE-friendly annotation and evaluate them on numerous sentence examples. Our experience of identifying MWEs in the Prague Dependency Treebank provides the base for the discussion and examples from other languages are added whenever appropriate. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Bejček, E., Straňák, P., & Zeman, D. (2011). Influence of treebank design on representation of multiword expressions. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6608 LNCS, pp. 1–14). https://doi.org/10.1007/978-3-642-19400-9_1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free