The detection and correction of erroneous Chinese characters is an important problem in many applications. This paper proposed an automatic method for correcting erroneous Chinese characters. The method is divided into two parts, which separately handle two types of erroneous character: The occurrence of an erroneous character in a word length of one, and the occurrence in a word length of two or more. The first primarily makes use of a rulesbased method, while the second integrates parameters of similarity and syntax rationality using a linear regression model to predict erroneous characters. Experimental results shown that the F1 and FPR of the proposed method are 0.34 and 0.18 respectively.
CITATION STYLE
Chang, T. H., Yang, C. H., & Chen, H. C. (2015). Introduction to a proofreading tool for chinese spelling check task of SIGHAN-8. In Proceedings of the 8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015 - co-located with 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL IJCNLP 2015 (pp. 50–55). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w15-3109
Mendeley helps you to discover research relevant for your work.