Orthographic Syllable as basic unit for SMT between Related Languages

18Citations
Citations of this article
91Readers
Mendeley users who have this article in their library.

Abstract

We explore the use of the orthographic syllable, a variable-length consonant-vowel sequence, as a basic unit of translation between related languages which use abugida or alphabetic scripts. We show that orthographic syllable level translation significantly outperforms models trained over other basic units (word, morpheme and character) when training over small parallel corpora.

Cite

CITATION STYLE

APA

Kunchukuttan, A., & Bhattacharyya, P. (2016). Orthographic Syllable as basic unit for SMT between Related Languages. In EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 1912–1917). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d16-1196

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free