Convolutional neural networks for image processing: An application in robot vision

Matthew Browne; Saeed Shiry Ghidary

Conference Proceedings

Convolutional neural networks for image processing: An application in robot vision

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2003) 2903 641-652

DOI: 10.1007/978-3-540-24581-0_55

66Citations

130Readers

Get full text

Abstract

Convolutional neural networks (CNNs) represent an interesting method for adaptive image processing, and form a link between general feed-forward neural networks and adaptive filters. Two dimensional CNNs are formed by one or more layers of two dimensional filters, with possible non-linear activation functions and/or down-sampling. CNNs possess key properties of translation invariance and spatially local connections (receptive fields). We present a description of the convolutional network architecture, and an application to practical image processing on a mobile robot. A CNN is used to detect and characterize cracks on an autonomous sewer inspection robot. The filter sizes used in all cases were 4x4, with non-linear activations between each layer. The number of feature maps used in the three hidden layers was, from input to output, 4, 4, 4. The network was trained using a dataset of 48x48 sub-regions drawn from 30 still image 320x240 pixel frames sampled from a prerecorded sewer pipe inspection video. 15 frames were used for training and 15 for validation of network performance. Although development of a CNN system for civil use is on-going, the results support the notion that data-based adaptive image processing methods such as CNNs are useful for image processing, or other applications where the input arrays are large, and spatially/temporally distributed. Further refinements of the CNN architecture, such as the implementation of separable filters, or extensions to three dimensional (ie. video) processing, are suggested.

Cite

CITATION STYLE

APA

Browne, M., & Ghidary, S. S. (2003). Convolutional neural networks for image processing: An application in robot vision. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2903, pp. 641–652). Springer Verlag. https://doi.org/10.1007/978-3-540-24581-0_55

Convolutional neural networks for image processing: An application in robot vision

Abstract

Cite

Register to see more suggestions