Abstract
We present a joint morphological-lexical language model (JMLLM) for use in statistical machine translation (SMT) of language pairs where one or both of the languages are morphologically rich. The proposed JMLLM takes advantage of the rich morphology to reduce the Out-Of-Vocabulary (OOV) rate, while keeping the predictive power of the whole words. It also allows incorporation of additional available semantic, syntactic and linguistic information about the morphemes and words into the language model. Preliminary experiments with an English to Dialectal-Arabic SMT system demonstrate improved translation performance over trigram based baseline language model.
Cite
CITATION STYLE
Sarikaya, R., & Deng, Y. (2007). Joint morphological-lexical language modeling for machine translation. In NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers (pp. 145–148). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1614108.1614145
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.