Multi-dimensional annotation and alignment in an English-German translation corpus

25Citations
Citations of this article
76Readers
Mendeley users who have this article in their library.

Abstract

This paper presents the compilation of the CroCo Corpus, an English-German translation corpus. Corpus design, annotation and alignment are described in detail. In order to guarantee the searchability and exchangeability of the corpus, XML stand-off mark-up is used as representation format for the multi-layer annotation. On this basis it is shown how the corpus can be queried using XQuery. Furthermore, the generalisation of results in terms of linguistic and translational research questions is briefly discussed.

Cite

CITATION STYLE

APA

Hansen-Schirra, S., Neumann, S., & Vela, M. (2006). Multi-dimensional annotation and alignment in an English-German translation corpus. In Proceedings of the 5th Workshop on NLP and XML: Multi-Dimensional Markup in Natural Language Processing, NLPXML 2006 at the 11th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2006 (pp. 35–42). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1621034.1621040

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free