Whale vocalization classification using feature extraction with resonance sparse signal decomposition and ridge extraction

9Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

Whales communicate using whistle vocalizations that are essentially underwater acoustic frequency-modulated tones. Inevitable environmental noise decreases recognition accuracy of these sounds during wide range detection. In this paper, we propose a robust time - frequency analysis method that combines resonance sparse signal decomposition (RSSD) and spectrogram ridge extraction. We apply RSSD to extract whistle components from the raw signal, and then we segment the ridge regions of the whistle spectrograms. By applying a partial derivative method, we extract the whistle spectrogram ridge representing an accurate trace of the whistle vocalization. From these results, we extract ridge features and use an SVM or a random forest to identify the whale species. We evaluated our method using experiments with samples for four whale species. Compared with direct ridge extraction directly without RSSD, our proposed method achieved better extraction of frequency characteristics of the vocalizations. Our proposed method achieved an accuracy rate of over 98% for sounds from four species when using five training samples.

Cite

CITATION STYLE

APA

Chen, H., Sun, H., Junejo, N. U. R., Yang, G., & Qi, J. (2019). Whale vocalization classification using feature extraction with resonance sparse signal decomposition and ridge extraction. IEEE Access, 7, 136358–136368. https://doi.org/10.1109/ACCESS.2019.2919321

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free