A question type driven framework to diversify visual question generation

46Citations
Citations of this article
53Readers
Mendeley users who have this article in their library.

Abstract

Visual question generation aims at asking questions about an image automatically. Existing research works on this topic usually generate a single question for each given image without considering the issue of diversity. In this paper, we propose a question type driven framework to produce multiple questions for a given image with different focuses. In our framework, each question is constructed following the guidance of a sampled question type in a sequence-to-sequence fashion. To diversify the generated questions, a novel conditional variational auto-encoder is introduced to generate multiple questions with a specific question type. Moreover, we design a strategy to conduct the question type distribution learning for each image to select the final questions. Experimental results on three benchmark datasets show that our framework outperforms the state-of-the-art approaches in terms of both relevance and diversity.

Cite

CITATION STYLE

APA

Fan, Z., Wei, Z., Li, P., Lan, Y., & Huang, X. (2018). A question type driven framework to diversify visual question generation. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2018-July, pp. 4048–4054). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2018/563

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free