Neural networks incorporating dictionaries for Chinese word segmentation

54Citations
Citations of this article
62Readers
Mendeley users who have this article in their library.

Abstract

In recent years, deep neural networks have achieved significant success in Chinese word segmentation and many other natural language processing tasks. Most of these algorithms are end-to-end trainable systems and can effectively process and learn from large scale labeled datasets. However, these methods typically lack the capability of processing rare words and data whose domains are different from training data. Previous statistical methods have demonstrated that human knowledge can provide valuable information for handling rare cases and domain shifting problems. In this paper, we seek to address the problem of incorporating dictionaries into neural networks for the Chinese word segmentation task. Two different methods that extend the bi-directional long short-term memory neural network are proposed to perform the task. To evaluate the performance of the proposed methods, state-of-the-art supervised models based methods and domain adaptation approaches are compared with our methods on nine datasets from different domains. The experimental results demonstrate that the proposed methods can achieve better performance than other state-of-the-art neural network methods and domain adaptation approaches in most cases.

Cite

CITATION STYLE

APA

Zhang, Q., Liu, X., & Fu, J. (2018). Neural networks incorporating dictionaries for Chinese word segmentation. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 5682–5689). AAAI press. https://doi.org/10.1609/aaai.v32i1.11959

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free