Abstract
A challenge of the computer vision community is to understand the semantics of an image that will allow for higher quality image generation based on existing high-level features and better analysis of (semi-) labeled datasets. Categorical labels aggregate a huge amount of information into a binary value which conceals valuable high-level concepts from the Machine Learning models. Towards addressing this challenge, this paper introduces a method, called Occlusion-based Latent Representations (OLR), for converting image labels to meaningful representations that capture a significant amount of data semantics. Besides being informationrich, these representations compose a disentangled low-dimensional latent space where each image label is encoded into a separate vector. We evaluate the quality of these representations in a series of experiments whose results suggest that the proposed model can capture data concepts and discover data interrelations.
Author supplied keywords
Cite
CITATION STYLE
Karatsiolis, S., & Kamilaris, A. (2021). Converting Image Labels to Meaningful and Information-rich Embeddings. In International Conference on Pattern Recognition Applications and Methods (Vol. 1, pp. 107–119). Science and Technology Publications, Lda. https://doi.org/10.5220/0010375801070119
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.