Unsupervised question answering (UQA) has been proposed to avoid the high cost of creating high-quality datasets for QA. One approach to UQA is to train a QA model with questions generated automatically. However, the generated questions are either too similar to a word sequence in the context or too drifted from the semantics of the context, thereby making it difficult to train a robust QA model. We propose a novel regularization method based on teacher-student architecture to avoid bias toward a particular question generation strategy and modulate the process of generating individual words when a question is generated. Our experiments demonstrate that we have achieved the goal of generating higher-quality questions for UQA across diverse QA datasets and tasks. We also show that this method can be useful for creating a QA model with few-shot learning.
CITATION STYLE
Kang, J., Hong, G., San Roman, H. P., & Myaeng, S. H. (2020). Regularization of distinct strategies for unsupervised question generation. In Findings of the Association for Computational Linguistics Findings of ACL: EMNLP 2020 (pp. 3266–3277). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.findings-emnlp.293
Mendeley helps you to discover research relevant for your work.