Converting Image Labels to Meaningful and Information-rich Embeddings

Savvas Karatsiolis; Andreas Kamilaris

Conference ProceedingsOPEN ACCESS

Converting Image Labels to Meaningful and Information-rich Embeddings

International Conference on Pattern Recognition Applications and Methods (2021) 1 107-119

DOI: 10.5220/0010375801070119

0Citations

8Readers

Get full text

Abstract

A challenge of the computer vision community is to understand the semantics of an image that will allow for higher quality image generation based on existing high-level features and better analysis of (semi-) labeled datasets. Categorical labels aggregate a huge amount of information into a binary value which conceals valuable high-level concepts from the Machine Learning models. Towards addressing this challenge, this paper introduces a method, called Occlusion-based Latent Representations (OLR), for converting image labels to meaningful representations that capture a significant amount of data semantics. Besides being informationrich, these representations compose a disentangled low-dimensional latent space where each image label is encoded into a separate vector. We evaluate the quality of these representations in a series of experiments whose results suggest that the proposed model can capture data concepts and discover data interrelations.

Author supplied keywords

Cite

CITATION STYLE

APA

Karatsiolis, S., & Kamilaris, A. (2021). Converting Image Labels to Meaningful and Information-rich Embeddings. In International Conference on Pattern Recognition Applications and Methods (Vol. 1, pp. 107–119). Science and Technology Publications, Lda. https://doi.org/10.5220/0010375801070119

Converting Image Labels to Meaningful and Information-rich Embeddings

Abstract

Author supplied keywords

Cite

Register to see more suggestions