DD-TIG at SemEval-2022 Task 5: Investigating the Relationships Between Multimodal and Unimodal Information in Misogynous Memes Detection and Classification

Ziming Zhou; Han Zhao; Jingjing Dong; Ning Ding; Xiaolong Liu; Kangli Zhang

Conference ProceedingsOPEN ACCESS

DD-TIG at SemEval-2022 Task 5: Investigating the Relationships Between Multimodal and Unimodal Information in Misogynous Memes Detection and Classification

SemEval 2022 - 16th International Workshop on Semantic Evaluation, Proceedings of the Workshop (2022) 563-570

DOI: 10.18653/v1/2022.semeval-1.77

5Citations

25Readers

Abstract

This paper describes our submission for task 5 Multimedia Automatic Misogyny Identification (MAMI) at SemEval-2022. The task is designed to detect and classify misogynous memes. To utilize both textual and visual information presented in a meme, we investigate several of the most recent visual-language transformer-based multimodal models and choose ERNIE-ViL-Large as our base model. For subtask A, with observations of models' overfitting on unimodal patterns, strategies are proposed to mitigate problems of biased words and template memes. For subtask B, we transform this multi-label problem into a multi-class one and experiment with oversampling and complementary techniques. Our approach places 2nd for subtask A and 5th for subtask B in this competition.

Cite

CITATION STYLE

APA

Zhou, Z., Zhao, H., Dong, J., Ding, N., Liu, X., & Zhang, K. (2022). DD-TIG at SemEval-2022 Task 5: Investigating the Relationships Between Multimodal and Unimodal Information in Misogynous Memes Detection and Classification. In SemEval 2022 - 16th International Workshop on Semantic Evaluation, Proceedings of the Workshop (pp. 563–570). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.semeval-1.77

DD-TIG at SemEval-2022 Task 5: Investigating the Relationships Between Multimodal and Unimodal Information in Misogynous Memes Detection and Classification

Abstract

Cite

Register to see more suggestions