Pattern recognition and digital signal processing techniques allow the design of automated systems for avian monitoring. They are a non-intrusive and cost-effective way to perform surveys of bird populations and assessments of biological diversity. In this study, a number of representation approaches for bird sounds are compared; namely, feature and dissimilarity representations. In order to take into account the non-stationary nature of the audio signals and to build robust dissimilarity representations, the application of the Earth Mover's Distance (EMD) to time-varying measurements is proposed. Measures of the leave-one-out 1-NN performance are used as comparison criteria. Results show that, overall, the Mel-ceptrum coefficients are the best alternative; specially when computed by frames and used in combination with EMD to generate dissimilarity representations. © 2011 Springer-Verlag.
CITATION STYLE
Ruiz-Muñoz, J. F., Orozco-Alzate, M., & Castellanos-Domínguez, C. G. (2011). Feature and dissimilarity representations for the sound-based recognition of bird species. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7042 LNCS, pp. 451–458). https://doi.org/10.1007/978-3-642-25085-9_53
Mendeley helps you to discover research relevant for your work.