VidVRD 2021: The Third Grand Challenge on Video Relation Detection

16Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

ACM Multimedia 2021 Video Relation Understanding Challenge is the third grand challenge which aims at exploring the relationship of subjects and objects appearing in videos for fine-grained and high-level video understanding. Given a video, the video relation detection model should output a serious of relation triplet subject, predicate, object and the corresponding trajectories of subject and object. The goal of this task is to promote research on developing video semantic understanding model, so as to perform complex inferences and mining of visual knowledge in videos. In this paper, we make a comprehensive and detailed introduction of this task, conclude the proposed algorithms in the last few years, and propose future direction for research in this task.

Cite

CITATION STYLE

APA

Ji, W., Li, Y., Wei, M., Shang, X., Xiao, J., Ren, T., & Chua, T. S. (2021). VidVRD 2021: The Third Grand Challenge on Video Relation Detection. In MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia (pp. 4779–4783). Association for Computing Machinery, Inc. https://doi.org/10.1145/3474085.3479232

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free