A structurally diverse minimal corpus for eliciting structural mappings between languages

0Citations
Citations of this article
40Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We describe an approach to creating a small but diverse corpus in English that can be used to elicit information about any target language. The focus of the corpus is on structural information. The resulting bilingual corpus can then be used for natural language processing tasks such as inferring transfer mappings for Machine Translation. The corpus is sufficiently small that a bilingual user can translate and wordalign it within a matter of hours. We describe how the corpus is created and how its structural diversity is ensured. We then argue that it is not necessary to introduce a large amount of redundancy into the corpus. This is shown by creating an increasingly redundant corpus and observing that the information gained converges as redundancy increases. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Probst, K., & Lavie, A. (2004). A structurally diverse minimal corpus for eliciting structural mappings between languages. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3265, 217–226. https://doi.org/10.1007/978-3-540-30194-3_24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free