This paper presents English-Hindi transliteration in the NEWS 2009 Machine Transliteration Shared Task adding source context modeling into state-of-the-art log-linear phrase-based statistical machine translation (PB-SMT). Source context features enable us to exploit source similarity in addition to target similarity, as modelled by the language model. We use a memory-based classification framework that enables efficient estimation of these features while avoiding data sparseness problems.We carried out experiments both at character and transliteration unit (TU) level. Position-dependent source context features produce significant improvements in terms of all evaluation metrics.
CITATION STYLE
Haque, R., Dandapat, S., Srivastava, A. K., Naskar, S. K., & Way, A. (2009). English-Hindi transliteration using context-informed PB-SMT: The DCU system for NEWS 2009. In NEWS 2009 - 2009 Named Entities Workshop: Shared Task on Transliteration at the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 (pp. 104–107). Association for Computational Linguistics (ACL).
Mendeley helps you to discover research relevant for your work.