CANCN-BERT: A Joint Pre-Trained Language Model for Classical and Modern Chinese

3Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Pre-Trained Models (PTMs) can learn general knowledge representations and perform well in Natural Language Processing (NLP) tasks. For the Chinese language, several PTMs are developed, however, most existing methods concentrate on modern Chinese and are not ideal for processing classical Chinese due to the differences in grammars and semantics between these two forms. In this paper, in order to process two forms of Chinese uniformly, we propose a novel Classical and Modern Chinese pre-trained language model (CANCN-BERT), with the advantage of effectively processing both classical and modern Chinese, which is an extension of BERT. Form-aware pre-training tasks are elaborately designed to train our model, so as to better adapt it to classical and modern Chinese corpus. Moreover, we define a joint model, proposing dedicated optimization methods through different paths with the control of the switch mechanism. Our model merges characteristics of both classical and modern Chinese, which can adequately and efficiently enhance the representation ability for both forms. Extensive experiments show that our model outperforms baseline models on processing classical and modern Chinese and achieves significant and consistent improvements. Also, the results of ablation experiments demonstrate the effectiveness of each module.

Cite

CITATION STYLE

APA

Ji, Z., Wang, X., Shen, Y., & Rao, G. (2021). CANCN-BERT: A Joint Pre-Trained Language Model for Classical and Modern Chinese. In International Conference on Information and Knowledge Management, Proceedings (pp. 3112–3116). Association for Computing Machinery. https://doi.org/10.1145/3459637.3482068

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free