ACT: An Attentive Convolutional Transformer for Efficient Text Classification

46Citations
Citations of this article
48Readers
Mendeley users who have this article in their library.

Abstract

Recently, Transformer has been demonstrating promising performance in many NLP tasks and showing a trend of replacing Recurrent Neural Network (RNN). Meanwhile, less attention is drawn to Convolutional Neural Network (CNN) due to its weak ability in capturing sequential and longdistance dependencies, although it has excellent local feature extraction capability. In this paper, we introduce an Attentive Convolutional Transformer (ACT) that takes the advantages of both Transformer and CNN for efficient text classification. Specifically, we propose a novel attentive convolution mechanism that utilizes the semantic meaning of convolutional filters attentively to transform text from complex word space to a more informative convolutional filter space where important n-grams are captured. ACT is able to capture both local and global dependencies effectively while preserving sequential information. Experiments on various text classification tasks and detailed analyses show that ACT is a lightweight, fast, and effective universal text classifier, outperforming CNNs, RNNs, and attentive models including Transformer.

Cite

CITATION STYLE

APA

Li, P., Zhong, P., Mao, K., Wang, D., Yang, X., Liu, Y., … See, S. (2021). ACT: An Attentive Convolutional Transformer for Efficient Text Classification. In 35th AAAI Conference on Artificial Intelligence, AAAI 2021 (Vol. 15, pp. 13261–13269). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v35i15.17566

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free