Visual question generation aims at asking questions about an image automatically. Existing research works on this topic usually generate a single question for each given image without considering the issue of diversity. In this paper, we propose a question type driven framework to produce multiple questions for a given image with different focuses. In our framework, each question is constructed following the guidance of a sampled question type in a sequence-to-sequence fashion. To diversify the generated questions, a novel conditional variational auto-encoder is introduced to generate multiple questions with a specific question type. Moreover, we design a strategy to conduct the question type distribution learning for each image to select the final questions. Experimental results on three benchmark datasets show that our framework outperforms the state-of-the-art approaches in terms of both relevance and diversity.
CITATION STYLE
Fan, Z., Wei, Z., Li, P., Lan, Y., & Huang, X. (2018). A question type driven framework to diversify visual question generation. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2018-July, pp. 4048–4054). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2018/563
Mendeley helps you to discover research relevant for your work.