Neural word segmentation learning for Chinese

106Citations
Citations of this article
201Readers
Mendeley users who have this article in their library.

Abstract

Most previous approaches to Chinese word segmentation formalize this problem as a character-based sequence labeling task so that only contextual information within fixed sized local windows and simple interactions between adjacent tags can be captured. In this paper, we propose a novel neural framework which thoroughly eliminates context windows and can utilize complete segmentation history. Our model employs a gated combination neural network over characters to produce distributed representations of word candidates, which are then given to a long shortterm memory (LSTM) language scoring model. Experiments on the benchmark datasets show that without the help of feature engineering as most existing approaches, our models achieve competitive or better performances with previous stateof- the-art methods.

References Powered by Scopus

Long Short-Term Memory

78352Citations
N/AReaders
Get full text

Learning phrase representations using RNN encoder-decoder for statistical machine translation

11809Citations
N/AReaders
Get full text

A Neural Probabilistic Language Model

5182Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Transition-based neural word segmentation

103Citations
N/AReaders
Get full text

Neural word segmentation with rich pretraining

87Citations
N/AReaders
Get full text

A stacking gated neural architecture for implicit discourse relation classification

69Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Cai, D., & Zhao, H. (2016). Neural word segmentation learning for Chinese. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (Vol. 1, pp. 409–420). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p16-1039

Readers over time

‘16‘17‘18‘19‘20‘21‘22‘23‘24‘2508162432

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 83

75%

Researcher 17

15%

Professor / Associate Prof. 6

5%

Lecturer / Post doc 4

4%

Readers' Discipline

Tooltip

Computer Science 110

86%

Linguistics 10

8%

Engineering 4

3%

Agricultural and Biological Sciences 4

3%

Save time finding and organizing research with Mendeley

Sign up for free
0