Automatic expansion of abbreviations in Chinese news text

Guohong Fu; Kang Kwong Luke; Guo Dong Zhou; Ruifeng Xu

Conference Proceedings

Automatic expansion of abbreviations in Chinese news text

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4182 LNCS 530-536

DOI: 10.1007/11880592_42

8Citations

1Readers

Get full text

Abstract

This paper presents an n-gram based approach to Chinese abbreviation expansion. In this study, we distinguish reduced abbreviations from non-reduced abbreviations that are created by elimination or generalization. For a reduced abbreviation, a mapping table is compiled to map each short-word in it to a set of long-words, and a bigram based Viterbi algorithm is thus applied to decode an appropriate combination of long-words as its full-form. For a non-reduced abbreviation, a dictionary of non-reduced abbreviation/full-form pairs is used to generate its expansion candidates, and a disambiguation technique is further employed to select a proper expansion based on bigram word segmentation. The evaluation on an abbreviation-expanded corpus built from the PKU corpus showed that the proposed system achieved a recall of 82.9% and a precision of 85.5% on average for different types of abbreviations in Chinese news text. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Fu, G., Luke, K. K., Zhou, G. D., & Xu, R. (2006). Automatic expansion of abbreviations in Chinese news text. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4182 LNCS, pp. 530–536). Springer Verlag. https://doi.org/10.1007/11880592_42

Automatic expansion of abbreviations in Chinese news text

Abstract

Cite

Register to see more suggestions