Phoneme strings based machine transliteration

Ying Qin

Conference Proceedings

Phoneme strings based machine transliteration

Qin Y

NLP-KE 2011 - Proceedings of the 7th International Conference on Natural Language Processing and Knowledge Engineering (2011) 304-309

DOI: 10.1109/NLPKE.2011.6138214

1Citations

3Readers

Get full text

Abstract

Transliteration is always used to translate source names with approximate equivalence of pronunciation into target language. Current direct orthographical mapping (DOM) approach does segmentation and alignment on the basis of the single syllable. However it is hard to break down English names into constituent parts according to their corresponding single Chinese characters. This document proposes an approach of segmentation and alignment on the unit of phoneme strings in transliteration between English and Chinese. To lessen the calculation of model training on whole corpus, we split the training data into several pools stochastically and each is used to train a model. The final results of transliteration are arranged according to the decoding probability of each model, called combined model. The combined machine transliteration system between English and Chinese performs remarkably well on the shared task of NEWS2011. © 2011 IEEE.

Author supplied keywords

Cite

CITATION STYLE

APA

Qin, Y. (2011). Phoneme strings based machine transliteration. In NLP-KE 2011 - Proceedings of the 7th International Conference on Natural Language Processing and Knowledge Engineering (pp. 304–309). https://doi.org/10.1109/NLPKE.2011.6138214

Phoneme strings based machine transliteration

Abstract

Author supplied keywords

Cite

Register to see more suggestions