Multiple data augmentation strategies for improving performance on automatic short answer scoring

Jiaqi Lun; Jia Zhu; Yong Tang; Min Yang

Conference ProceedingsOPEN ACCESS

Multiple data augmentation strategies for improving performance on automatic short answer scoring

AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (2020) 13446-13453

DOI: 10.1609/aaai.v34i09.7062

67Citations

66Readers

Abstract

Automatic short answer scoring (ASAS) is a research subject of intelligent education, which is a hot field of natural language understanding. Many experiments have confirmed that the ASAS system is not good enough, because its performance is limited by the training data. Focusing on the problem, we propose MDA-ASAS, multiple data augmentation strategies for improving performance on automatic short answer scoring. MDA-ASAS is designed to learn language representation enhanced by data augmentation strategies, which includes back-translation, correct answer as reference answer, and swap content. We argue that external knowledge has a profound impact on the ASAS process. Meanwhile, the Bidirectional Encoder Representations from Transformers (BERT) model has been shown to be effective for improving many natural language processing tasks, which acquires more semantic, grammatical and other features in large amounts of unsupervised data, and actually adds external knowledge. Combining with the latest BERT model, our experimental results on the ASAS dataset show that MDAASAS brings a significant gain over state-of-art.We also perform extensive ablation studies and suggest parameters for practical use.

Cite

CITATION STYLE

APA

Lun, J., Zhu, J., Tang, Y., & Yang, M. (2020). Multiple data augmentation strategies for improving performance on automatic short answer scoring. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 13446–13453). AAAI press. https://doi.org/10.1609/aaai.v34i09.7062

Multiple data augmentation strategies for improving performance on automatic short answer scoring

Abstract

Cite

Register to see more suggestions