A single shot framework with multi-scale feature fusion for geospatial object detection

34Citations
Citations of this article
48Readers
Mendeley users who have this article in their library.

Abstract

With the rapid advances in remote-sensing technologies and the larger number of satellite images, fast and effective object detection plays an important role in understanding and analyzing image information, which could be further applied to civilian and military fields. Recently object detection methods with region-based convolutional neural network have shown excellent performance. However, these two-stage methods contain region proposal generation and object detection procedures, resulting in low computation speed. Because of the expensive manual costs, the quantity of well-annotated aerial images is scarce, which also limits the progress of geospatial object detection in remote sensing. In this paper, on the one hand, we construct and release a large-scale remote-sensing dataset for geospatial object detection (RSD-GOD) that consists of 5 different categories with 18,187 annotated images and 40,990 instances. On the other hand, we design a single shot detection framework with multi-scale feature fusion. The feature maps from different layers are fused together through the up-sampling and concatenation blocks to predict the detection results. High-level features with semantic information and low-level features with fine details are fully explored for detection tasks, especially for small objects. Meanwhile, a soft non-maximum suppression strategy is put into practice to select the final detection results. Extensive experiments have been conducted on two datasets to evaluate the designed network. Results show that the proposed approach achieves a good detection performance and obtains the mean average precision value of 89.0% on a newly constructed RSD-GOD dataset and 83.8% on the Northwestern Polytechnical University very high spatial resolution-10 (NWPU VHR-10) dataset at 18 frames per second (FPS) on a NVIDIA GTX-1080Ti GPU.

References Powered by Scopus

Histograms of oriented gradients for human detection

30478Citations
N/AReaders
Get full text

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

26003Citations
N/AReaders
Get full text

SSD: Single shot multibox detector

24773Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Precise and robust ship detection for high-resolution SAR imagery based on HR-SDNet

134Citations
N/AReaders
Get full text

Transformer with Transfer CNN for Remote-Sensing-Image Object Detection

104Citations
N/AReaders
Get full text

Small-Object Detection in UAV-Captured Images via Multi-Branch Parallel Feature Pyramid Networks

88Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhuang, S., Wang, P., Jiang, B., Wang, G., & Wang, C. (2019). A single shot framework with multi-scale feature fusion for geospatial object detection. Remote Sensing, 11(5). https://doi.org/10.3390/rs11050594

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 25

83%

Researcher 3

10%

Lecturer / Post doc 2

7%

Readers' Discipline

Tooltip

Computer Science 16

55%

Engineering 7

24%

Earth and Planetary Sciences 4

14%

Environmental Science 2

7%

Save time finding and organizing research with Mendeley

Sign up for free