Computing translation units and quantifying parallelism in parallel dependency treebanks

7Citations
Citations of this article
77Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The linguistic quality of a parallel treebank depends crucially on the parallelism between the source and target language annotations. We propose a linguistic notion of translation units and a quantitative measure of parallelism for parallel dependency treebanks, and demonstrate how the proposed translation units and parallelism measure can be used to compute transfer rules, spot annotation errors, and compare different annotation schemes with respect to each other. The proposal is evaluated on the 100,000 word Copenhagen Danish-English Dependency Treebank. © 2007 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Buch-Kromann, M. (2007). Computing translation units and quantifying parallelism in parallel dependency treebanks. In ACL 2007: The LAW - Proceedings of The Linguistic Annotation Workshop (pp. 69–76). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1642059.1642071

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free