NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

Yu Chun Wang; Chun Kai Wu; Richard Tzong Han Tsai

Conference Proceedings

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2015) 2015-July 83-87

DOI: 10.18653/v1/w15-3913

4Citations

70Readers

Get full text

Abstract

This paper describes our approach to English-Korean and English-Chinese transliteration task of NEWS 2015. We use different grapheme segmentation approaches on source and target languages to train several transliteration models based on the M2M-aligner and DirecTL+, a string transduction model. Then, we use two reranking techniques based on string similarity and web co-occurrence to select the best transliteration among the prediction results from the different models. Our English-Korean standard and non-standard runs achieve 0.4482 and 0.5067 in top-1 accuracy respectively, and our English-Chinese standard runs achieves 0.2925 in top-1 accuracy. c 2015 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Wang, Y. C., Wu, C. K., & Tsai, R. T. H. (2015). NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2015-July, pp. 83–87). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w15-3913

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

Abstract

Cite

Register to see more suggestions