Abstract
Understanding image advertisements is a challenging task, often requiring non-literal interpretation. We argue that standard image-based predictions are insufficient for symbolism prediction. Following the intuition that texts and images are complementary in advertising, we introduce a multimodal ensemble of a state of the art image-based classifier, a classifier based on an object detection architecture, and a fine-tuned language model applied to texts extracted from ads by OCR. The resulting system establishes a new state of the art in symbolism prediction.
Cite
CITATION STYLE
Savchenko, A., Alekseev, A., Kwon, S., Tutubalina, E., Miasnikov, E., & Nikolenko, S. (2020). Ad Lingua: Text Classification Improves Symbolism Prediction in Image Advertisements. In COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (pp. 1886–1892). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.coling-main.171
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.