VLAD is not necessary for CNN

Dan Yu; Xiao Jun Wu

Conference ProceedingsOPEN ACCESS

VLAD is not necessary for CNN

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9915 LNCS 492-499

DOI: 10.1007/978-3-319-49409-8_41

2Citations

5Readers

Abstract

Global convolutional neural networks (CNNs) activations lack geometric invariance, and in order to address this problem, Gong et al. proposed multi-scale orderless pooling(MOP-CNN), which extracts CNN activations for local patches at multiple scale levels, and performs orderless VLAD pooling to extract features. However, we find that this method can improve the performance mainly because it extracts global and local representation simultaneously, and VLAD pooling is not necessary as the representations extracted by CNN is good enough for classification. In this paper, we propose a new method to extract multi-scale features of CNNs, leading to a new structure of deep learning. The method extracts CNN representations for local patches at multiple scale levels, then concatenates all the representations at each level separately, finally, concatenates the results of all levels. The CNN is trained on the ImageNet dataset to extract features and it is then transferred to other datasets. The experimental results obtained on the databases MITIndoor and Caltech-101 show that the performance of our proposed method is superior to the MOP-CNN.

Author supplied keywords

Cite

CITATION STYLE

APA

Yu, D., & Wu, X. J. (2016). VLAD is not necessary for CNN. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9915 LNCS, pp. 492–499). Springer Verlag. https://doi.org/10.1007/978-3-319-49409-8_41

VLAD is not necessary for CNN

Abstract

Author supplied keywords

Cite

Register to see more suggestions