Sparsely aggregated convolutional networks

Ligeng Zhu; Ruizhi Deng; Michael Maire; Zhiwei Deng; Greg Mori; Ping Tan

Conference ProceedingsOPEN ACCESS

Sparsely aggregated convolutional networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11216 LNCS 192-208

DOI: 10.1007/978-3-030-01258-8_12

14Citations

181Readers

Abstract

We explore a key architectural aspect of deep convolutional neural networks: the pattern of internal skip connections used to aggregate outputs of earlier layers for consumption by deeper layers. Such aggregation is critical to facilitate training of very deep networks in an end-to-end manner. This is a primary reason for the widespread adoption of residual networks, which aggregate outputs via cumulative summation. While subsequent works investigate alternative aggregation operations (e.g. concatenation), we focus on an orthogonal question: which outputs to aggregate at a particular point in the network. We propose a new internal connection structure which aggregates only a sparse set of previous outputs at any given depth. Our experiments demonstrate this simple design change offers superior performance with fewer parameters and lower computational requirements. Moreover, we show that sparse aggregation allows networks to scale more robustly to 1000+ layers, thereby opening future avenues for training long-running visual processes.

Cite

CITATION STYLE

APA

Zhu, L., Deng, R., Maire, M., Deng, Z., Mori, G., & Tan, P. (2018). Sparsely aggregated convolutional networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11216 LNCS, pp. 192–208). Springer Verlag. https://doi.org/10.1007/978-3-030-01258-8_12

Sparsely aggregated convolutional networks

Abstract

Cite

Register to see more suggestions