Neural architecture search for convolutional neural networks with attention

9Citations
Citations of this article
20Readers
Mendeley users who have this article in their library.

Abstract

The recent development of neural architecture search (NAS) has enabled us to automatically discover architectures of neural networks with high performance within a few days. Convolutional neural networks extract fruitful features by repeatedly applying standard operations (convolutions and poolings). However, these operations also extract useless or even disturbing features. Attention mechanisms enable neural networks to discard information of no interest, having achieved the state-of-the-art performance. While a variety of attentions for CNNs have been proposed, current NAS methods have paid a little attention to them. In this study, we propose a novel NAS method that searches attentions as well as operations. We examined several patterns to arrange attentions and operations, and found that attentions work better when they have their own search space and follow operations. We demonstrate the superior performance of our method in experiments on CIFAR-10, CIFAR-100, and ImageNet datasets. The found architecture achieved lower classification error rates and required fewer parameters compared to those found by current NAS methods.

Cite

CITATION STYLE

APA

NAKAI, K., MATSUBARA, T., & UEHARA, K. (2021). Neural architecture search for convolutional neural networks with attention. IEICE Transactions on Information and Systems, E104D(2), 312–321. https://doi.org/10.1587/transinf.2020EDP7111

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free