Combining a two-step conditional random field model and a joint source channel model for machine transliteration

Dong Yang; Paul Dixon; Yi Cheng Pan; Tasuku Oonishi; Masanobu Nakamura; Sadaoki Furui

Conference ProceedingsOPEN ACCESS

Combining a two-step conditional random field model and a joint source channel model for machine transliteration

NEWS 2009 - 2009 Named Entities Workshop: Shared Task on Transliteration at the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 (2009) 72-75

DOI: 10.3115/1699705.1699724

14Citations

69Readers

Abstract

This paper describes our system for “NEWS 2009 Machine Transliteration Shared Task” (NEWS 2009). We only participated in the standard run, which is a direct orthographical mapping (DOP) between two languages without using any intermediate phonemic mapping. We propose a new two-step conditional random field (CRF) model for DOP machine transliteration, in which the first CRF segments a source word into chunks and the second CRF maps the chunks to a word in the target language. The two-step CRF model obtains a slightly lower top-1 accuracy when compared to a state-of-the-art n-gram joint source-channel model. The combination of the CRF model with the joint source-channel leads to improvements in all the tasks. The official result of our system in the NEWS 2009 shared task confirms the effectiveness of our system; where we achieved 0.627 top-1 accuracy for Japanese transliterated to Japanese Kanji(JJ), 0.713 for English-to-Chinese(E2C) and 0.510 for English-to-Japanese Katakana(E2J).

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Yang, D., Dixon, P., Pan, Y. C., Oonishi, T., Nakamura, M., & Furui, S. (2009). Combining a two-step conditional random field model and a joint source channel model for machine transliteration. In NEWS 2009 - 2009 Named Entities Workshop: Shared Task on Transliteration at the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 (pp. 72–75). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1699705.1699724

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 23

68%

Researcher 7

21%

Professor / Associate Prof. 2

Lecturer / Post doc 2

Readers' Discipline

Computer Science 26

72%

Linguistics 6

17%

Engineering 3

Neuroscience 1

Combining a two-step conditional random field model and a joint source channel model for machine transliteration

Abstract

References Powered by Scopus

Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm

Iterative language model estimation: Efficient data structure & algorithms

Grapheme-to-phone using finite-state transducers

Cited by Powered by Scopus

A linear-time bottom-up discourse parser with constraints and post-editing

Named entity transliteration with sequence-to-sequence neural network

Improving low-resource machine transliteration by using 3-way transfer learning

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline