Multi-align: Combining linguistic and statistical techniques to improve alignments for adaptable MT

Necip Fazil Ayan; Bonnie J. Dorr; Nizar Habash

Journal Article

Multi-align: Combining linguistic and statistical techniques to improve alignments for adaptable MT

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3265 17-26

DOI: 10.1007/978-3-540-30194-3_3

9Citations

39Readers

Get full text

Abstract

An adaptable statistical or hybrid MT system relies heavily on the quality of word-level alignments of real-world data. Statistical alignment approaches provide a reasonable initial estimate for word alignment. However, they cannot handle certain types of linguistic phenomena such as long-distance dependencies and structural differences between languages. We address this issue in Multi-Align, a new framework for incremental testing of different alignment algorithms and their combinations. Our design allows users to tune their systems to the properties of a particular genre/domain while still benefiting from general linguistic knowledge associated with a language pair. We demonstrate that a combination of statistical and linguistically-informed alignments can resolve translation divergences during the alignment process. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Ayan, N. F., Dorr, B. J., & Habash, N. (2004). Multi-align: Combining linguistic and statistical techniques to improve alignments for adaptable MT. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3265, 17–26. https://doi.org/10.1007/978-3-540-30194-3_3

Multi-align: Combining linguistic and statistical techniques to improve alignments for adaptable MT

Abstract

Cite

Register to see more suggestions