Multi-turn video question answering via multi-stream hierarchical attention context network

44Citations
Citations of this article
37Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Conversational video question answering is a challenging task in visual information retrieval, which generates the accurate answer from the referenced video contents according to the visual conversation context and given question. However, the existing visual question answering methods mainly tackle the problem of single-turn video question answering, which may be ineffectively applied for multiturn video question answering directly, due to the insufficiency of modeling the sequential conversation context. In this paper, we study the problem of multi-turn video question answering from the viewpoint of multi-step hierarchical attention context network learning. We first propose the hierarchical attention context network for context-aware question understanding by modeling the hierarchically sequential conversation context structure. We then develop the multi-stream spatio-temporal attention network for learning the joint representation of the dynamic video contents and context-aware question embedding. We next devise the hierarchical attention context network learning method with multi-step reasoning process for multi-turn video question answering. We construct two large-scale multi-turn video question answering datasets. The extensive experiments show the effectiveness of our method.

Cite

CITATION STYLE

APA

Zhao, Z., Jiang, X., Cai, D., Xiao, J., He, X., & Pu, S. (2018). Multi-turn video question answering via multi-stream hierarchical attention context network. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2018-July, pp. 3690–3696). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2018/513

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free