The social media prediction task is aiming at predicting content popularity which includes social multimedia data such as photos, videos, and news. The task can not only help make better decisions for recommendation, but also reveals the public attention from evolutionary social systems. In this paper, we propose a novel approach named curriculum learning for wide multimedia-based transformer with graph target detection(CL-WMTG). The curriculum learning is designed for the transformer to improve the efficiency of model convergence. The mechanism of wide multimedia-based transformer is to make the model capable of learning cross information from text, pictures and other features(e.g. categories, location). Moreover, the graph target detection part can extract different features in the picture by pretrained model and reconstruct the features with a homogeneous graph network. We achieved third place in the SMP Challenge 2020.
CITATION STYLE
Chen, W., Hong, F., Huang, C., Zhang, S., Wang, R., Xie, R., … Wang, Y. (2020). Curriculum Learning for Wide Multimedia-Based Transformer with Graph Target Detection. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 4575–4579). Association for Computing Machinery, Inc. https://doi.org/10.1145/3394171.3416275
Mendeley helps you to discover research relevant for your work.