Bridging between computer and robot vision through data augmentation: A case study on object recognition

13Citations
Citations of this article
29Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Despite the impressive progress brought by deep network in visual object recognition, robot vision is still far from being a solved problem. The most successful convolutional architectures are developed starting from ImageNet, a large scale collection of images of object categories downloaded from the Web. This kind of images is very different from the situated and embodied visual experience of robots deployed in unconstrained settings. To reduce the gap between these two visual experiences, this paper proposes a simple yet effective data augmentation layer that zooms on the object of interest and simulates the object detection outcome of a robot vision system. The layer, that can be used with any convolutional deep architecture, brings to an increase in object recognition performance of up to 7%, in experiments performed over three different benchmark databases. An implementation of our robot data augmentation layer has been made publicly available.

Cite

CITATION STYLE

APA

D’Innocente, A., Carlucci, F. M., Colosi, M., & Caputo, B. (2017). Bridging between computer and robot vision through data augmentation: A case study on object recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10528 LNCS, pp. 384–393). Springer Verlag. https://doi.org/10.1007/978-3-319-68345-4_34

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free