Converting Image Labels to Meaningful and Information-rich Embeddings

0Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A challenge of the computer vision community is to understand the semantics of an image that will allow for higher quality image generation based on existing high-level features and better analysis of (semi-) labeled datasets. Categorical labels aggregate a huge amount of information into a binary value which conceals valuable high-level concepts from the Machine Learning models. Towards addressing this challenge, this paper introduces a method, called Occlusion-based Latent Representations (OLR), for converting image labels to meaningful representations that capture a significant amount of data semantics. Besides being informationrich, these representations compose a disentangled low-dimensional latent space where each image label is encoded into a separate vector. We evaluate the quality of these representations in a series of experiments whose results suggest that the proposed model can capture data concepts and discover data interrelations.

Cite

CITATION STYLE

APA

Karatsiolis, S., & Kamilaris, A. (2021). Converting Image Labels to Meaningful and Information-rich Embeddings. In International Conference on Pattern Recognition Applications and Methods (Vol. 1, pp. 107–119). Science and Technology Publications, Lda. https://doi.org/10.5220/0010375801070119

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free