Joint morphological-lexical language modeling for machine translation

Ruhi Sarikaya; Yonggang Deng

Conference Proceedings

Joint morphological-lexical language modeling for machine translation

NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers (2007) 145-148

DOI: 10.3115/1614108.1614145

21Citations

86Readers

Get full text

Abstract

We present a joint morphological-lexical language model (JMLLM) for use in statistical machine translation (SMT) of language pairs where one or both of the languages are morphologically rich. The proposed JMLLM takes advantage of the rich morphology to reduce the Out-Of-Vocabulary (OOV) rate, while keeping the predictive power of the whole words. It also allows incorporation of additional available semantic, syntactic and linguistic information about the morphemes and words into the language model. Preliminary experiments with an English to Dialectal-Arabic SMT system demonstrate improved translation performance over trigram based baseline language model.

Cite

CITATION STYLE

APA

Sarikaya, R., & Deng, Y. (2007). Joint morphological-lexical language modeling for machine translation. In NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers (pp. 145–148). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1614108.1614145

Joint morphological-lexical language modeling for machine translation

Abstract

Cite

Register to see more suggestions