SMCS: Automatic Real-Time Classification of Ambient Sounds, Based on a Deep Neural Network and Mel Frequency Cepstral Coefficients

0Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper presents a model to classify ambient sounds in an automatic and real-time way using the sound dataset provided in the Kaggle free sounds competition. For this, two data preprocessing techniques are performed, the first, length normalization that unifies the audio inputs to a single time interval and the second, property normalization that standardizes the sampling frequency and bit depth; This also includes a DNN (Deep Neural Network) capable of classifying common environmental sounds, the input for the network is formed by MFCC (Mel Frequency Cepstral Coefficients) vectors, which reduces the processing time improving the response capacity of the model for detect sounds, especially those that are considered warning signs about environmental threats, facilitating the mobility of people with hearing impairment.

Cite

CITATION STYLE

APA

Mora-Regalado, M. J., Ruiz-Vivanco, O., González-Eras, A., & Torres-Carrión, P. (2020). SMCS: Automatic Real-Time Classification of Ambient Sounds, Based on a Deep Neural Network and Mel Frequency Cepstral Coefficients. In Communications in Computer and Information Science (Vol. 1194 CCIS, pp. 245–253). Springer. https://doi.org/10.1007/978-3-030-42520-3_20

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free