Channel and frequency attention module for diverse animal sound classification

12Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.

Abstract

In-class species classification based on animal sounds is a highly challenging task even with the latest deep learning technique applied. The difficulty of distinguishing the species is further compounded when the number of species is large within the same class. This paper presents a novel approach for fine categorization of animal species based on their sounds by using pre-trained CNNs and a new self-attention module well-suited for acoustic signals The proposed method is shown effective as it achieves average species accuracy of 98.37% and the minimum species accuracy of 94.38%, the highest among the competing baselines, which include CNN’s without self-attention and CNN’s with CBAM, FAM, and CFAM but without pre-training.

Cite

CITATION STYLE

APA

Kyungdeuk, K. O., Park, J., Han, D. K., & Ko, H. (2019). Channel and frequency attention module for diverse animal sound classification. IEICE Transactions on Information and Systems, E102D(12), 2615–2618. https://doi.org/10.1587/transinf.2019EDL8128

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free