Abstract
A large body of physiological findings has suggested the vision system understands a scene in terms of its local features such as lines and curves. A highly notable computer algorithm developed that models such behaviour is the Convolutional Neural Network (CNN). Whilst recognising an object in various scales remains trivial for the human vision system, CNNs struggle to achieve the same behaviour. Recent physiological findings are suggesting two new paradigms. Firstly, the visual system uses both local and global features in its recognition function. Secondly, the brain uses a distributed processing architecture to learn information from multiple modalities. In this paper we combine these paradigms and propose a distributed information integration model called D-Net to improve scale-invariant classification of images. We use a CNN to extract local features and, inspired by Google’s INCEPTION model, develop a trainable method using filter pyramids to extract global features called Filter Pyramid Convolutions (FPC). D-Net locally processes CNN and FPC features, fuses the outcomes and obtains a global estimate via the central processor. We test D-Net on classification of scaled images on benchmark datasets. Our results show D-Net’s potential effectiveness towards classification of scaled images.
Author supplied keywords
Cite
CITATION STYLE
Kumar, D., & Sharma, D. (2020). Distributed information integration in convolutional neural networks. In VISIGRAPP 2020 - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (Vol. 5, pp. 491–498). SciTePress. https://doi.org/10.5220/0009150404910498
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.