In this paper, we propose an end-to-end Attention-Block network for image retrieval (ABIR), which greatly increases the retrieval accuracy without human annotations like bounding boxes. Specifically, our network utilizes coarse-scale feature fusion, which generates the attentive local features via combining the information from different intermediate layers. Detailed feature information is extracted with the application of two attention blocks. Extensive experiments show that our method outperforms the state-of-the-art by a significant margin on four public datasets for image retrieval tasks.
CITATION STYLE
Nie, X., Lu, H., Wang, Z., Liu, J., & Guo, Z. (2019). Weakly supervised image retrieval via coarse-scale feature fusion and multi-level attention blocks. In ICMR 2019 - Proceedings of the 2019 ACM International Conference on Multimedia Retrieval (pp. 48–52). Association for Computing Machinery, Inc. https://doi.org/10.1145/3323873.3325017
Mendeley helps you to discover research relevant for your work.