Environmental sound recognition using masked conditional neural networks

Fady Medhat; David Chesmore; John Robinson

Conference Proceedings

Environmental sound recognition using masked conditional neural networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10604 LNAI 373-385

DOI: 10.1007/978-3-319-69179-4_26

5Citations

13Readers

Get full text

Abstract

Neural network based architectures used for sound recognition are usually adapted from other application domains, which may not harness sound related properties. The ConditionaL Neural Network (CLNN) is designed to consider the relational properties across frames in a temporal signal, and its extension the Masked ConditionaL Neural Network (MCLNN) embeds a filterbank behavior within the network, which enforces the network to learn in frequency bands rather than bins. Additionally, it automates the exploration of different feature combinations analogous to handcrafting the optimum combination of features for a recognition task. We applied the MCLNN to the environmental sounds of the ESC-10 dataset. The MCLNN achieved competitive accuracies compared to state-of-the-art convolutional neural networks and hand-crafted attempts.

Author supplied keywords

Cite

CITATION STYLE

APA

Medhat, F., Chesmore, D., & Robinson, J. (2017). Environmental sound recognition using masked conditional neural networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10604 LNAI, pp. 373–385). Springer Verlag. https://doi.org/10.1007/978-3-319-69179-4_26

Environmental sound recognition using masked conditional neural networks

Abstract

Author supplied keywords

Cite

Register to see more suggestions