Pre-training BERT on domain resources for short answer grading

Chul Sung; Tengfei Ma; Tejas Indulal Dhamecha; Vinay Reddy; Swarnadeep Saha; Rishi Arora

Conference ProceedingsOPEN ACCESS

Pre-training BERT on domain resources for short answer grading

EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (2019) 6071-6075

DOI: 10.18653/v1/D19-1628

91Citations

170Readers

Abstract

Pre-trained BERT contextualized representations have achieved state-of-the-art results on multiple downstream NLP tasks by fine-tuning with task-specific data. While there has been a lot of focus on task-specific fine-tuning, there has been limited work on improving the pre-trained representations. In this paper, we explore ways of improving the pre-trained contextual representations for the task of automatic short answer grading, a critical component of intelligent tutoring systems. We show that the pre-trained BERT model can be improved by augmenting data from the domain-specific resources like textbooks. We also present a new approach to use labeled short answering grading data for further enhancement of the language model. Empirical evaluation on multi-domain datasets shows that task-specific fine-tuning on the enhanced pre-trained language model achieves superior performance for short answer grading.

Cite

CITATION STYLE

APA

Sung, C., Ma, T., Dhamecha, T. I., Reddy, V., Saha, S., & Arora, R. (2019). Pre-training BERT on domain resources for short answer grading. In EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (pp. 6071–6075). Association for Computational Linguistics. https://doi.org/10.18653/v1/D19-1628

Pre-training BERT on domain resources for short answer grading

Abstract

Cite

Register to see more suggestions