CopyBERT: A unified approach to question generation with self-attention

8Citations
Citations of this article
94Readers
Mendeley users who have this article in their library.

Abstract

Contextualized word embeddings provide better initialization for neural networks that deal with various natural language understanding (NLU) tasks including question answering (QA) and more recently, question generation (QG). Apart from providing meaningful word representations, pre-trained transformer models, such as BERT also provide self-attentions which encode syntactic information that can be probed for dependency parsing and POS-tagging. In this paper, we show that the information from self-attentions of BERT are useful for language modeling of questions conditioned on paragraph and answer phrases. To control the attention span, we use semi-diagonal mask and utilize a shared model for encoding and decoding, unlike sequence-to-sequence. We further employ copy mechanism over self-attentions to achieve state-of-the-art results for question generation on SQuAD dataset.

Cite

CITATION STYLE

APA

Varanasi, S., Amin, S., & Neumann, G. (2020). CopyBERT: A unified approach to question generation with self-attention. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 25–31). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.nlp4convai-1.3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free