An adaptable statistical or hybrid MT system relies heavily on the quality of word-level alignments of real-world data. Statistical alignment approaches provide a reasonable initial estimate for word alignment. However, they cannot handle certain types of linguistic phenomena such as long-distance dependencies and structural differences between languages. We address this issue in Multi-Align, a new framework for incremental testing of different alignment algorithms and their combinations. Our design allows users to tune their systems to the properties of a particular genre/domain while still benefiting from general linguistic knowledge associated with a language pair. We demonstrate that a combination of statistical and linguistically-informed alignments can resolve translation divergences during the alignment process. © Springer-Verlag 2004.
CITATION STYLE
Ayan, N. F., Dorr, B. J., & Habash, N. (2004). Multi-align: Combining linguistic and statistical techniques to improve alignments for adaptable MT. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3265, 17–26. https://doi.org/10.1007/978-3-540-30194-3_3
Mendeley helps you to discover research relevant for your work.