Improving Neural Chinese Word Segmentation with Lexicon-enhanced Adaptive Attention

7Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Chinese word segmentation (CWS) is an important research topic in information retrieval (IR) and natural language processing (NLP). Significant progresses have been made by deep neural networks with context features. However, these deep models may fail to deal with rare or ambiguous words, thus limit the overall CWS performance. In this paper, we propose a lexicon-enhanced adaptive attention network (LAAN), which takes full advantage of external lexicons to deal with the rare or ambiguous words. Specifically, we devise an adaptive attention mechanism to learn the lexicon-aware representation. In addition, we propose a fusion gate to effectively integrate the additional word information with context information to improve the performance of CWS. LAAN is evaluated on four benchmark datasets, and the experimental results demonstrate that LAAN has robust superiority over the compared methods.

Cite

CITATION STYLE

APA

Zhao, X., Yang, M., Qu, Q., & Sun, Y. (2020). Improving Neural Chinese Word Segmentation with Lexicon-enhanced Adaptive Attention. In SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 1953–1956). Association for Computing Machinery, Inc. https://doi.org/10.1145/3397271.3401328

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free