Abstract
ACM Multimedia 2021 Video Relation Understanding Challenge is the third grand challenge which aims at exploring the relationship of subjects and objects appearing in videos for fine-grained and high-level video understanding. Given a video, the video relation detection model should output a serious of relation triplet subject, predicate, object and the corresponding trajectories of subject and object. The goal of this task is to promote research on developing video semantic understanding model, so as to perform complex inferences and mining of visual knowledge in videos. In this paper, we make a comprehensive and detailed introduction of this task, conclude the proposed algorithms in the last few years, and propose future direction for research in this task.
Author supplied keywords
Cite
CITATION STYLE
Ji, W., Li, Y., Wei, M., Shang, X., Xiao, J., Ren, T., & Chua, T. S. (2021). VidVRD 2021: The Third Grand Challenge on Video Relation Detection. In MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia (pp. 4779–4783). Association for Computing Machinery, Inc. https://doi.org/10.1145/3474085.3479232
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.