ε-extension Hidden Markov models and weighted transducers for machine transliteration

0Citations
Citations of this article
75Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We describe in detail a method for transliterating an English string to a foreign language string evaluated on five different languages, including Tamil, Hindi, Russian, Chinese, and Kannada. Our method involves deriving substring alignments from the training data and learning a weighted finite state transducer from these alignments. We define an ǫ-extension Hidden Markov Model to derive alignments between training pairs and a heuristic to extract the substring alignments. Our method involves only two tunable parameters that can be optimized on held-out data.

Cite

CITATION STYLE

APA

Vardarajan, B., & Rao, D. (2009). ε-extension Hidden Markov models and weighted transducers for machine transliteration. In NEWS 2009 - 2009 Named Entities Workshop: Shared Task on Transliteration at the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 (pp. 120–123). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1699705.1699736

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free