Doubly convolutional neural networks

Shuangfei Zhai; Yu Cheng; Weining Lu; Zhongfei Zhang

Conference Proceedings

Doubly convolutional neural networks

Advances in Neural Information Processing Systems (2016) 1090-1098

ISSN: 10495258

44Citations

202Readers

Abstract

Building large models with parameter sharing accounts for most of the success of deep convolutional neural networks (CNNs). In this paper, we propose doubly con-volutional neural networks (DCNNs), which significantly improve the performance of CNNs by further exploring this idea. In stead of allocating a set of convolutional filters that are independently learned, a DCNN maintains groups of filters where filters within each group are translated versions of each other. Practically, a DCNN can be easily implemented by a two-step convolution procedure, which is supported by most modern deep learning libraries. We perform extensive experiments on three image classification benchmarks: CIFAR-10, CIFAR-100 and ImageNet, and show that DCNNs consistently outperform other competing architectures. We have also verified that replacing a convolutional layer with a doubly convolutional layer at any depth of a CNN can improve its performance. Moreover, various design choices of DCNNs are demonstrated, which shows that DCNN can serve the dual purpose of building more accurate models and/or reducing the memory footprint without sacrificing the accuracy.

Cite

CITATION STYLE

APA

Zhai, S., Cheng, Y., Lu, W., & Zhang, Z. (2016). Doubly convolutional neural networks. In Advances in Neural Information Processing Systems (pp. 1090–1098). Neural information processing systems foundation.

Doubly convolutional neural networks

Abstract

Cite

Register to see more suggestions