Detection of Chinese word usage errors for non-Native Chinese learners with bidirectional LSTM

12Citations
Citations of this article
79Readers
Mendeley users who have this article in their library.

Abstract

Selecting appropriate words to compose a sentence is one common problem faced by non-native Chinese learners. In this paper, we propose (bidirectional) LSTM sequence labeling models and explore various features to detect word usage errors in Chinese sentences. By combining CWINDOW word embedding features and POS information, the best bidirectional LSTM model achieves accuracy 0.5138 and MRR 0.6789 on the HSK dataset. For 80.79% of the test data, the model ranks the ground-truth within the top two at position level.

Cite

CITATION STYLE

APA

Shiue, Y. T., Huang, H. H., & Chen, H. H. (2017). Detection of Chinese word usage errors for non-Native Chinese learners with bidirectional LSTM. In ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers) (Vol. 2, pp. 404–410). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/P17-2064

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free