Natural language word prediction model based on multi-window convolution and residual network

Jingyun Yang; Hengjun Wang; Kexiang Guo

Journal ArticleOPEN ACCESS

Natural language word prediction model based on multi-window convolution and residual network

IEEE Access (2020) 8 188036-188043

DOI: 10.1109/ACCESS.2020.3031200

16Citations

14Readers

Abstract

In this paper, we propose MCNN-ReMGU model based on multi-window convolution and residual-connected minimal gated unit (MGU) network for the natural language word prediction. First, the convolution kernels with different sizes are used to extract the local feature information of different graininess between the word sequences. Then, the extracted features are fed to the residual-connected MGU network. Finally, the prediction results are output by the SoftMax layer. Through the residual-connection processing of MGU network in the model, not only the problems of vanishing gradient and network degradation are effectively solved, but also the long-term dependence between word sequences is effectively extracted to predict the next word accurately. Meanwhile, the introduction of the convolution kernel in a convolutional neural network (CNN) enables the feature information between word sequences to be extracted more fully. The experimental results on the Penn Treebank and WikiText-2 datasets show that the proposed method has certain advantages in the word prediction task.

Author supplied keywords

Cite

CITATION STYLE

APA

Yang, J., Wang, H., & Guo, K. (2020). Natural language word prediction model based on multi-window convolution and residual network. IEEE Access, 8, 188036–188043. https://doi.org/10.1109/ACCESS.2020.3031200

Natural language word prediction model based on multi-window convolution and residual network

Abstract

Author supplied keywords

Cite

Register to see more suggestions