A New Method of Improving BERT for Text Classification

Shaomin Zheng; Meng Yang

Conference Proceedings

A New Method of Improving BERT for Text Classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11936 LNCS 442-452

DOI: 10.1007/978-3-030-36204-1_37

32Citations

24Readers

Get full text

Abstract

Text classification is a basic task in natural language processing. Recently, pre-training models such as BERT have achieved outstanding results compared with previous methods. However, BERT fails to take into account local information in the text such as a sentence and a phrase. In this paper, we present a BERT-CNN model for text classification. By adding CNN to the task-specific layers of BERT model, our model can get the information of important fragments in the text. In addition, we input the local representation along with the output of the BERT into the transformer encoder in order to take advantage of the self-attention mechanism and finally get the representation of the whole text through transformer layer. Extensive experiments demonstrate that our model obtains competitive performance against state-of-the-art baselines on four benchmark datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Zheng, S., & Yang, M. (2019). A New Method of Improving BERT for Text Classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11936 LNCS, pp. 442–452). Springer. https://doi.org/10.1007/978-3-030-36204-1_37

A New Method of Improving BERT for Text Classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions