English-Hindi transliteration using context-informed PB-SMT: The DCU system for NEWS 2009

Rejwanul Haque; Sandipan Dandapat; Ankit Kumar Srivastava; Sudip Kumar Naskar; Andy Way

Conference ProceedingsOPEN ACCESS

English-Hindi transliteration using context-informed PB-SMT: The DCU system for NEWS 2009

NEWS 2009 - 2009 Named Entities Workshop: Shared Task on Transliteration at the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 (2009) 104-107

11Citations

73Readers

Abstract

This paper presents English-Hindi transliteration in the NEWS 2009 Machine Transliteration Shared Task adding source context modeling into state-of-the-art log-linear phrase-based statistical machine translation (PB-SMT). Source context features enable us to exploit source similarity in addition to target similarity, as modelled by the language model. We use a memory-based classification framework that enables efficient estimation of these features while avoiding data sparseness problems.We carried out experiments both at character and transliteration unit (TU) level. Position-dependent source context features produce significant improvements in terms of all evaluation metrics.

Cite

CITATION STYLE

APA

Haque, R., Dandapat, S., Srivastava, A. K., Naskar, S. K., & Way, A. (2009). English-Hindi transliteration using context-informed PB-SMT: The DCU system for NEWS 2009. In NEWS 2009 - 2009 Named Entities Workshop: Shared Task on Transliteration at the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 (pp. 104–107). Association for Computational Linguistics (ACL).

English-Hindi transliteration using context-informed PB-SMT: The DCU system for NEWS 2009

Abstract

Cite

Register to see more suggestions