Abstract
The linguistic quality of a parallel treebank depends crucially on the parallelism between the source and target language annotations. We propose a linguistic notion of translation units and a quantitative measure of parallelism for parallel dependency treebanks, and demonstrate how the proposed translation units and parallelism measure can be used to compute transfer rules, spot annotation errors, and compare different annotation schemes with respect to each other. The proposal is evaluated on the 100,000 word Copenhagen Danish-English Dependency Treebank. © 2007 Association for Computational Linguistics.
Cite
CITATION STYLE
Buch-Kromann, M. (2007). Computing translation units and quantifying parallelism in parallel dependency treebanks. In ACL 2007: The LAW - Proceedings of The Linguistic Annotation Workshop (pp. 69–76). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1642059.1642071
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.