ε-extension Hidden Markov models and weighted transducers for machine transliteration

Balakrishnan Vardarajan; Delip Rao

Conference Proceedings

ε-extension Hidden Markov models and weighted transducers for machine transliteration

NEWS 2009 - 2009 Named Entities Workshop: Shared Task on Transliteration at the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 (2009) 120-123

DOI: 10.3115/1699705.1699736

0Citations

75Readers

Get full text

Abstract

We describe in detail a method for transliterating an English string to a foreign language string evaluated on five different languages, including Tamil, Hindi, Russian, Chinese, and Kannada. Our method involves deriving substring alignments from the training data and learning a weighted finite state transducer from these alignments. We define an ǫ-extension Hidden Markov Model to derive alignments between training pairs and a heuristic to extract the substring alignments. Our method involves only two tunable parameters that can be optimized on held-out data.

Cite

CITATION STYLE

APA

Vardarajan, B., & Rao, D. (2009). ε-extension Hidden Markov models and weighted transducers for machine transliteration. In NEWS 2009 - 2009 Named Entities Workshop: Shared Task on Transliteration at the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 (pp. 120–123). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1699705.1699736

ε-extension Hidden Markov models and weighted transducers for machine transliteration

Abstract

Cite

Register to see more suggestions