Curriculum Learning for Wide Multimedia-Based Transformer with Graph Target Detection

8Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The social media prediction task is aiming at predicting content popularity which includes social multimedia data such as photos, videos, and news. The task can not only help make better decisions for recommendation, but also reveals the public attention from evolutionary social systems. In this paper, we propose a novel approach named curriculum learning for wide multimedia-based transformer with graph target detection(CL-WMTG). The curriculum learning is designed for the transformer to improve the efficiency of model convergence. The mechanism of wide multimedia-based transformer is to make the model capable of learning cross information from text, pictures and other features(e.g. categories, location). Moreover, the graph target detection part can extract different features in the picture by pretrained model and reconstruct the features with a homogeneous graph network. We achieved third place in the SMP Challenge 2020.

Cite

CITATION STYLE

APA

Chen, W., Hong, F., Huang, C., Zhang, S., Wang, R., Xie, R., … Wang, Y. (2020). Curriculum Learning for Wide Multimedia-Based Transformer with Graph Target Detection. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 4575–4579). Association for Computing Machinery, Inc. https://doi.org/10.1145/3394171.3416275

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free