Contextualized word embeddings provide better initialization for neural networks that deal with various natural language understanding (NLU) tasks including question answering (QA) and more recently, question generation (QG). Apart from providing meaningful word representations, pre-trained transformer models, such as BERT also provide self-attentions which encode syntactic information that can be probed for dependency parsing and POS-tagging. In this paper, we show that the information from self-attentions of BERT are useful for language modeling of questions conditioned on paragraph and answer phrases. To control the attention span, we use semi-diagonal mask and utilize a shared model for encoding and decoding, unlike sequence-to-sequence. We further employ copy mechanism over self-attentions to achieve state-of-the-art results for question generation on SQuAD dataset.
CITATION STYLE
Varanasi, S., Amin, S., & Neumann, G. (2020). CopyBERT: A unified approach to question generation with self-attention. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 25–31). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.nlp4convai-1.3
Mendeley helps you to discover research relevant for your work.