Aligning Turkish and English parallel texts for statistical machine translation

Ilknur D. El-Kahlout; Kemal Oflazer

Conference Proceedings

Aligning Turkish and English parallel texts for statistical machine translation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3733 LNCS 616-625

DOI: 10.1007/11569596_64

0Citations

4Readers

Get full text

Abstract

This paper presents a preliminary work on aligning Turkish and English parallel texts towards developing a statistical machine translation system for English and Turkish. To avoid the data sparseness problem and to uncover relations between sublexical components of words such as morphemes, we have converted our parallel texts to a morphemic representation and then used standard word alignment algorithms. Results from a mere 3K sentences of parallel English-Turkish texts show that we are able to link Turkish morphemes with English morphemes and function words quite successfully. We have also used the Turkish WordNet which is linked with the English WordNet, as a bootstrapping dictionary to constrain root word alignments. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

El-Kahlout, I. D., & Oflazer, K. (2005). Aligning Turkish and English parallel texts for statistical machine translation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3733 LNCS, pp. 616–625). https://doi.org/10.1007/11569596_64

Aligning Turkish and English parallel texts for statistical machine translation

Abstract

Cite

Register to see more suggestions