The Focus-Aspect-Value model for explainable prediction of subjective visual interpretation

Tushar Karayil; Philipp Blandfort; Jörn Hees; Andreas Dengel

Conference ProceedingsOPEN ACCESS

The Focus-Aspect-Value model for explainable prediction of subjective visual interpretation

ICMR 2019 - Proceedings of the 2019 ACM International Conference on Multimedia Retrieval (2019) 16-24

DOI: 10.1145/3323873.3325026

3Citations

6Readers

Abstract

Subjective visual interpretation is a challenging yet important topic in computer vision. Many approaches reduce this problem to the prediction of adjective- or attribute-labels from images. However, most of these do not take attribute semantics into account, or only process the image in a holistic manner. Furthermore, there is a lack of relevant datasets with fine-grained subjective labels. In this paper, we propose the Focus-Aspect-Value (FAV) model to structure the process of capturing subjectivity in image processing, and introduce a novel dataset following this way of modeling. We run experiments on this dataset to compare several deep learning methods and find that incorporating context information based on tensor multiplication outperforms the default way of information fusion (concatenation).

Cite

CITATION STYLE

APA

Karayil, T., Blandfort, P., Hees, J., & Dengel, A. (2019). The Focus-Aspect-Value model for explainable prediction of subjective visual interpretation. In ICMR 2019 - Proceedings of the 2019 ACM International Conference on Multimedia Retrieval (pp. 16–24). Association for Computing Machinery, Inc. https://doi.org/10.1145/3323873.3325026

The Focus-Aspect-Value model for explainable prediction of subjective visual interpretation

Abstract

Cite

Register to see more suggestions