This paper investigates certain methods of training adopted in the Statistical Machine Translator (SMT) from English to Malayalam. In English Malayalam SMT, the word to word translation is determined by training the parallel corpus. Our primary goal is to improve the alignment model by reducing the number of possible alignments of all sentence pairs present in the bilingual corpus. Incorporating morphological information into the parallel corpus with the help of the parts of speech tagger has brought around better training results with improved accuracy. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Sebastian, M. P., Sheena Kurian, K., & Santhosh Kumar, G. (2010). Alignment model and training technique in SMT from English to Malayalam. In Communications in Computer and Information Science (Vol. 94 CCIS, pp. 305–315). https://doi.org/10.1007/978-3-642-14834-7_29
Mendeley helps you to discover research relevant for your work.