Abstract
We describe in detail a method for transliterating an English string to a foreign language string evaluated on five different languages, including Tamil, Hindi, Russian, Chinese, and Kannada. Our method involves deriving substring alignments from the training data and learning a weighted finite state transducer from these alignments. We define an ǫ-extension Hidden Markov Model to derive alignments between training pairs and a heuristic to extract the substring alignments. Our method involves only two tunable parameters that can be optimized on held-out data.
Cite
CITATION STYLE
Vardarajan, B., & Rao, D. (2009). ε-extension Hidden Markov models and weighted transducers for machine transliteration. In NEWS 2009 - 2009 Named Entities Workshop: Shared Task on Transliteration at the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 (pp. 120–123). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1699705.1699736
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.