Leveraging EfficientNet and contrastive learning for accurate global-scale location estimation

11Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we address the problem of global-scale image geolocation, proposing a mixed classification-retrieval scheme. Unlike other methods that strictly tackle the problem as a classification or retrieval task, we combine the two practices in a unified solution leveraging the advantages of each approach with two different modules. The first leverages the EfficientNet architecture to assign images to a specific geographic cell in a robust way. The second introduces a new residual architecture that is trained with contrastive learning to map input images to an embedding space that minimizes the pairwise geodesic distance of same-location images. For the final location estimation, the two modules are combined with a search-within-cell scheme, where the locations of most similar images from the predicted geographic cell are aggregated based on a spatial clustering scheme. Our approach demonstrates very competitive performance on four public datasets, achieving new state-of-the-art performance in fine granularity scales, i.e., 15.0% at 1km range on Im2GPS3k.

Cite

CITATION STYLE

APA

Kordopatis-Zilos, G., Galopoulos, P., Papadopoulos, S., & Kompatsiaris, I. (2021). Leveraging EfficientNet and contrastive learning for accurate global-scale location estimation. In ICMR 2021 - Proceedings of the 2021 International Conference on Multimedia Retrieval (pp. 155–163). Association for Computing Machinery, Inc. https://doi.org/10.1145/3460426.3463644

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free