Small object bird detection in infrared drone videos using mask R-CNN deep learning

10Citations
Citations of this article
29Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A reliable method to estimate population sizes of wild turkeys (Meleagris gallopavo) using unmanned aerial vehicles and thermal video imaging data collected at several field sites in Texas is described. Automating the data processing of airborne survey videos provides a fast and reproducible way to count wild turkeys for wildlife management and conservation. A deep learning semantic segmentation pipeline is developed to detect and count roosting Rio-Grande wild turkeys (M.g. intermedia) which appear as small faint objects in drone-based thermal IR videos. The proposed approach to detect roosting turkeys that appear as small objects, relies on Mask R-CNN, a deep architecture semantic segmentation algorithm. This is followed by a post-processing data association and filtering (DAF) process for counting the number of roosting birds. DAF was used to eliminate false positives like rocks and other small bright objects, which often have noisy detections across temporally adjacent video frames, that can be filtered using appearance association and distance-based gating across time. Transfer learning was used to train the Mask R-CNN network by initializing using ImageNet weights. Drone-based thermal IR videos are extremely challenging due to the complexity of the natural environment including weather effects, occlusion of birds, terrain, trees, complex tree shapes, rocks, water and thermal inversion. The transect videos were collected at night at several times and altitudes to optimize data collection opportunities without disturbing the roosting turkeys. Preliminary performance evaluation using 280 video frames is promising.

Cite

CITATION STYLE

APA

Kassim, Y. M., Byrne, M. E., Burch, C., Mote, K., Hardin, J., Larsen, D. R., & Palaniappan, K. (2020). Small object bird detection in infrared drone videos using mask R-CNN deep learning. In IS and T International Symposium on Electronic Imaging Science and Technology (Vol. 2020). Society for Imaging Science and Technology. https://doi.org/10.2352/ISSN.2470-1173.2020.8.IMAWM-085

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free