Think twice: A post-processing approach for the chinese spelling error correction

Wei Gou; Zheng Chen

Journal ArticleOPEN ACCESS

Think twice: A post-processing approach for the chinese spelling error correction

Applied Sciences (Switzerland) (2021) 11(13)

DOI: 10.3390/app11135832

10Citations

14Readers

Abstract

Chinese Spelling Error Correction is a hot subject in the field of natural language processing. Researchers have already produced many great solutions, from the initial rule-based solution to the current deep learning method. At present, SpellGCN, proposed by Alibaba’s team, achieves the best results of which character level precision over SIGHAN2013 is 98.4%. However, when we apply this algorithm to practical error correction tasks, it produces many false error correction results. We believe that this is because the corpus used for model training contains significantly more errors than the text used for model correcting. In response to this problem, we propose performing a post-processing operation on the error correction tasks. We employ the initial model’s output as a candidate character, obtain various features of the character itself and its context, and then use a classification model to filter the initial model’s false error correction results. The post-processing idea introduced in this paper can apply to most Chinese Spelling Error Correction models to improve their performance over practical error correction tasks.

Author supplied keywords

Cite

CITATION STYLE

APA

Gou, W., & Chen, Z. (2021). Think twice: A post-processing approach for the chinese spelling error correction. Applied Sciences (Switzerland), 11(13). https://doi.org/10.3390/app11135832

Think twice: A post-processing approach for the chinese spelling error correction

Abstract

Author supplied keywords

Cite

Register to see more suggestions