Combining a two-step conditional random field model and a joint source channel model for machine transliteration

14Citations
Citations of this article
69Readers
Mendeley users who have this article in their library.

Abstract

This paper describes our system for “NEWS 2009 Machine Transliteration Shared Task” (NEWS 2009). We only participated in the standard run, which is a direct orthographical mapping (DOP) between two languages without using any intermediate phonemic mapping. We propose a new two-step conditional random field (CRF) model for DOP machine transliteration, in which the first CRF segments a source word into chunks and the second CRF maps the chunks to a word in the target language. The two-step CRF model obtains a slightly lower top-1 accuracy when compared to a state-of-the-art n-gram joint source-channel model. The combination of the CRF model with the joint source-channel leads to improvements in all the tasks. The official result of our system in the NEWS 2009 shared task confirms the effectiveness of our system; where we achieved 0.627 top-1 accuracy for Japanese transliterated to Japanese Kanji(JJ), 0.713 for English-to-Chinese(E2C) and 0.510 for English-to-Japanese Katakana(E2J).

References Powered by Scopus

Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm

4765Citations
N/AReaders
Get full text

Iterative language model estimation: Efficient data structure & algorithms

43Citations
N/AReaders
Get full text

Grapheme-to-phone using finite-state transducers

36Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A linear-time bottom-up discourse parser with constraints and post-editing

148Citations
N/AReaders
Get full text

Named entity transliteration with sequence-to-sequence neural network

8Citations
N/AReaders
Get full text

Improving low-resource machine transliteration by using 3-way transfer learning

7Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Yang, D., Dixon, P., Pan, Y. C., Oonishi, T., Nakamura, M., & Furui, S. (2009). Combining a two-step conditional random field model and a joint source channel model for machine transliteration. In NEWS 2009 - 2009 Named Entities Workshop: Shared Task on Transliteration at the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 (pp. 72–75). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1699705.1699724

Readers over time

‘10‘11‘12‘14‘15‘17‘18‘19‘20‘21‘22‘23‘2406121824

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 23

68%

Researcher 7

21%

Professor / Associate Prof. 2

6%

Lecturer / Post doc 2

6%

Readers' Discipline

Tooltip

Computer Science 26

72%

Linguistics 6

17%

Engineering 3

8%

Neuroscience 1

3%

Save time finding and organizing research with Mendeley

Sign up for free
0