Chinese Text Classification Method Based on BERT Word Embedding

Ziniu Wang; Zhilin Huang; Jianling Gao

Conference ProceedingsOPEN ACCESS

Chinese Text Classification Method Based on BERT Word Embedding

ACM International Conference Proceeding Series (2020) 66-71

DOI: 10.1145/3395260.3395273

10Citations

20Readers

Get full text

Abstract

In this paper, we enhance the semantic representation of the word through the BERT pre-training language model, dynamically generates the semantic vector according to the context of the character, and then inputs the character vector embedded as a character-level word vector sequence into the CapsNet.We builted the BiGRU module in the capsule network for text feature extraction, and introduced attention mechanism to focus on key information.We use the corpus of baidu's Chinese question and answer data set and only take the types of questions as classified samples to conduct experiments.We used the separate BERT network and the CapsNet as a comparative experiment. Finally, the experimental results show that the model effect is better than using one of the models alone, and the effect is improved.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, Z., Huang, Z., & Gao, J. (2020). Chinese Text Classification Method Based on BERT Word Embedding. In ACM International Conference Proceeding Series (pp. 66–71). Association for Computing Machinery. https://doi.org/10.1145/3395260.3395273

Chinese Text Classification Method Based on BERT Word Embedding

Abstract

Author supplied keywords

Cite

Register to see more suggestions