This paper addresses the task of zero-shot image classification. The key contribution of the proposed approach is to control the semantic embedding of images - one of the main ingredients of zero-shot learning - by formulating it as a metric learning problem. The optimized empirical criterion associates two types of sub-task constraints: metric discriminating capacity and accurate attribute prediction. This results in a novel expression of zero-shot learning not requiring the notion of class in the training phase: only pairs of image/attributes, augmented with a consistency indicator, are given as ground truth. At test time, the learned model can predict the consistency of a test image with a given set of attributes, allowing flexible ways to produce recognition inferences. Despite its simplicity, the proposed approach gives state-ofthe-art results on four challenging datasets used for zero-shot recognition evaluation.
CITATION STYLE
Bucher, M., Herbin, S., & Jurie, F. (2016). Improving semantic embedding consistency by metric learning for zero-shot classiffication. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9909 LNCS, pp. 730–746). Springer Verlag. https://doi.org/10.1007/978-3-319-46454-1_44
Mendeley helps you to discover research relevant for your work.