Chinese word segmentation is an important and necessary problem to analyze Chinese texts. In this paper, we focus on the primary challenges in Chinese word segmentation: low accuracy of out-of-vocabulary word. To resolve this difficult problems, we group the "similar" characters to generate more abstract representation. Experimental results show that character abstraction yields a significant relative error reduction of 24.83% in average over the state-of-the-art baseline. © Springer-Verlag 2013.
CITATION STYLE
Tian, L., Qiu, X., & Huang, X. (2013). Chinese word segmentation with character abstraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8202 LNAI, pp. 36–43). https://doi.org/10.1007/978-3-642-41491-6_4
Mendeley helps you to discover research relevant for your work.