Adaptive bilingual sentence alignment

Thomas C. Chuang; G. N. You; Jason S. Chang

Conference Proceedings

Adaptive bilingual sentence alignment

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2002) 2499 21-30

DOI: 10.1007/3-540-45820-4_3

10Citations

32Readers

Get full text

Abstract

We present a new approach to the problem of aligning English and Chinese sentences in a bilingual corpus based on adaptive learning. While using length information alone produces surprisingly good results for aligning bilingual French and English sentences with success rates well over 95%, it does not fair as well for the alignment of English and Chinese sentences. The crux of the problem lies in greater variability of lengths and match types of the matched sentences. We propose to cope with such variability via a two-pass scheme under which model parameters can be learned from the data at hand. Experiments show that under the approach bilingual English-Chinese texts can be aligned effectively across diverse domains, genres and translation directions with accuracy rates approaching 99%.

Cite

CITATION STYLE

APA

Chuang, T. C., You, G. N., & Chang, J. S. (2002). Adaptive bilingual sentence alignment. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2499, pp. 21–30). Springer Verlag. https://doi.org/10.1007/3-540-45820-4_3

Adaptive bilingual sentence alignment

Abstract

Cite

Register to see more suggestions