This paper presents the compilation of the CroCo Corpus, an English-German translation corpus. Corpus design, annotation and alignment are described in detail. In order to guarantee the searchability and exchangeability of the corpus, XML stand-off mark-up is used as representation format for the multi-layer annotation. On this basis it is shown how the corpus can be queried using XQuery. Furthermore, the generalisation of results in terms of linguistic and translational research questions is briefly discussed.
CITATION STYLE
Hansen-Schirra, S., Neumann, S., & Vela, M. (2006). Multi-dimensional annotation and alignment in an English-German translation corpus. In Proceedings of the 5th Workshop on NLP and XML: Multi-Dimensional Markup in Natural Language Processing, NLPXML 2006 at the 11th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2006 (pp. 35–42). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1621034.1621040
Mendeley helps you to discover research relevant for your work.