Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

65Citations
Citations of this article
88Readers
Mendeley users who have this article in their library.

Abstract

Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for user-generated text in Chinese language. Most of the Chinese spelling errors are misused semantically, phonetically or graphically similar characters. Previous attempts notice this phenomenon and try to utilize the similarity relationship for this task. However, these methods use either heuristics or handcrafted confusion sets to predict the correct character. In this paper, we propose a Chinese spell checker called REALISE, by directly leveraging the multimodal information of the Chinese characters. The REALISE model tackles the CSC task by (1) capturing the semantic, phonetic and graphic information of the input characters, and (2) selectively mixing the information in these modalities to predict the correct output. Experiments on the SIGHAN benchmarks show that the proposed model outperforms strong baselines by a large margin.

Cite

CITATION STYLE

APA

Xu, H. D., Li, Z., Zhou, Q., Li, C., Wang, Z., Cao, Y., … Mao, X. L. (2021). Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp. 716–728). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-acl.64

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free