Combining Contextual Information by Self-attention Mechanism in Convolutional Neural Networks for Text Classification

Xin Wu; Yi Cai; Qing Li; Jingyun Xu; Ho fung Leung

Conference Proceedings

Combining Contextual Information by Self-attention Mechanism in Convolutional Neural Networks for Text Classification

Wu X
Cai Y
Li Q
et al.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11233 LNCS 453-467

DOI: 10.1007/978-3-030-02922-7_31

16Citations

14Readers

Get full text

Abstract

Convolutional neural networks (CNN) are widely used in many NLP tasks, which can employ convolutional filters to capture useful semantic features of texts. However, convolutional filters with small window size may lose global context information of texts, simply increasing window size will bring the problems of data sparsity and enormous parameters. To capture global context information, we propose to use the self-attention mechanism to obtain contextual word embeddings. We present two methods to combine word and contextual embeddings, then apply convolutional neural networks to capture semantic features. Experimental results on five commonly used datasets show the effectiveness of our proposed methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Wu, X., Cai, Y., Li, Q., Xu, J., & Leung, H. fung. (2018). Combining Contextual Information by Self-attention Mechanism in Convolutional Neural Networks for Text Classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11233 LNCS, pp. 453–467). Springer Verlag. https://doi.org/10.1007/978-3-030-02922-7_31

Combining Contextual Information by Self-attention Mechanism in Convolutional Neural Networks for Text Classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions