Large-scale automatic species identification

Jeff Mo; Eibe Frank; Varvara Vetrova

Conference Proceedings

Large-scale automatic species identification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10400 LNAI 301-312

DOI: 10.1007/978-3-319-63004-5_24

N/ACitations

23Readers

Get full text

Abstract

The crowd-sourced Naturewatch GBIF dataset is used to obtain a species classification dataset containing approximately 1.2 million photos of nearly 20 thousand different species of biological organisms observed in their natural habitat. We present a general hierarchical species identification system based on deep convolutional neural networks trained on the NatureWatch dataset. The dataset contains images taken under a wide variety of conditions and is heavily imbalanced, with most species associated with only few images. We apply multi-view classification as a way to lend more influence to high frequency details, hierarchical fine-tuning to help with class imbalance and provide regularisation, and automatic specificity control for optimising classification depth. Our system achieves 55.8% accuracy when identifying individual species and around 90% accuracy at an average taxonomy depth of 5.1—equivalent to the taxonomic rank of “family”—when applying automatic specificity control.

Author supplied keywords

Cite

CITATION STYLE

APA

Mo, J., Frank, E., & Vetrova, V. (2017). Large-scale automatic species identification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10400 LNAI, pp. 301–312). Springer Verlag. https://doi.org/10.1007/978-3-319-63004-5_24

Large-scale automatic species identification

Abstract

Author supplied keywords

Cite

Register to see more suggestions