Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark

Kibok Lee; Hao Yang; Satyaki Chakraborty; Zhaowei Cai; Gurumurthy Swaminathan; Avinash Ravichandran; Onkar Dabeer

Conference Proceedings

Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13680 LNCS 366-382

DOI: 10.1007/978-3-031-20044-1_21

2Citations

27Readers

Get full text

Abstract

Most existing works on few-shot object detection (FSOD) focus on a setting where both pre-training and few-shot learning datasets are from a similar domain. However, few-shot algorithms are important in multiple domains; hence evaluation needs to reflect the broad applications. We propose a Multi-dOmain Few-Shot Object Detection (MoFSOD) benchmark consisting of 10 datasets from a wide range of domains to evaluate FSOD algorithms. We comprehensively analyze the impacts of freezing layers, different architectures, and different pre-training datasets on FSOD performance. Our empirical results show several key factors that have not been explored in previous works: 1) contrary to previous belief, on a multi-domain benchmark, fine-tuning (FT) is a strong baseline for FSOD, performing on par or better than the state-of-the-art (SOTA) algorithms; 2) utilizing FT as the baseline allows us to explore multiple architectures, and we found them to have a significant impact on down-stream few-shot tasks, even with similar pre-training performances; 3) by decoupling pre-training and few-shot learning, MoFSOD allows us to explore the impact of different pre-training datasets, and the right choice can boost the performance of the down-stream tasks significantly. Based on these findings, we list possible avenues of investigation for improving FSOD performance and propose two simple modifications to existing algorithms that lead to SOTA performance on the MoFSOD benchmark. The code is available here.

Author supplied keywords

Cite

CITATION STYLE

APA

Lee, K., Yang, H., Chakraborty, S., Cai, Z., Swaminathan, G., Ravichandran, A., & Dabeer, O. (2022). Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13680 LNCS, pp. 366–382). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-20044-1_21

Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark

Abstract

Author supplied keywords

Cite

Register to see more suggestions