Improving Neural Chinese Word Segmentation with Lexicon-enhanced Adaptive Attention

Xiaoyan Zhao; Min Yang; Qiang Qu; Yang Sun

Conference ProceedingsOPEN ACCESS

Improving Neural Chinese Word Segmentation with Lexicon-enhanced Adaptive Attention

SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (2020) 1953-1956

DOI: 10.1145/3397271.3401328

9Citations

10Readers

Get full text

Abstract

Chinese word segmentation (CWS) is an important research topic in information retrieval (IR) and natural language processing (NLP). Significant progresses have been made by deep neural networks with context features. However, these deep models may fail to deal with rare or ambiguous words, thus limit the overall CWS performance. In this paper, we propose a lexicon-enhanced adaptive attention network (LAAN), which takes full advantage of external lexicons to deal with the rare or ambiguous words. Specifically, we devise an adaptive attention mechanism to learn the lexicon-aware representation. In addition, we propose a fusion gate to effectively integrate the additional word information with context information to improve the performance of CWS. LAAN is evaluated on four benchmark datasets, and the experimental results demonstrate that LAAN has robust superiority over the compared methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhao, X., Yang, M., Qu, Q., & Sun, Y. (2020). Improving Neural Chinese Word Segmentation with Lexicon-enhanced Adaptive Attention. In SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 1953–1956). Association for Computing Machinery, Inc. https://doi.org/10.1145/3397271.3401328

Improving Neural Chinese Word Segmentation with Lexicon-enhanced Adaptive Attention

Abstract

Author supplied keywords

Cite

Register to see more suggestions