SMCS: Automatic Real-Time Classification of Ambient Sounds, Based on a Deep Neural Network and Mel Frequency Cepstral Coefficients

María José Mora-Regalado; Omar Ruiz-Vivanco; Alexandra González-Eras; Pablo Torres-Carrión

Conference Proceedings

SMCS: Automatic Real-Time Classification of Ambient Sounds, Based on a Deep Neural Network and Mel Frequency Cepstral Coefficients

Communications in Computer and Information Science (2020) 1194 CCIS 245-253

DOI: 10.1007/978-3-030-42520-3_20

0Citations

4Readers

Get full text

Abstract

This paper presents a model to classify ambient sounds in an automatic and real-time way using the sound dataset provided in the Kaggle free sounds competition. For this, two data preprocessing techniques are performed, the first, length normalization that unifies the audio inputs to a single time interval and the second, property normalization that standardizes the sampling frequency and bit depth; This also includes a DNN (Deep Neural Network) capable of classifying common environmental sounds, the input for the network is formed by MFCC (Mel Frequency Cepstral Coefficients) vectors, which reduces the processing time improving the response capacity of the model for detect sounds, especially those that are considered warning signs about environmental threats, facilitating the mobility of people with hearing impairment.

Author supplied keywords

Cite

CITATION STYLE

APA

Mora-Regalado, M. J., Ruiz-Vivanco, O., González-Eras, A., & Torres-Carrión, P. (2020). SMCS: Automatic Real-Time Classification of Ambient Sounds, Based on a Deep Neural Network and Mel Frequency Cepstral Coefficients. In Communications in Computer and Information Science (Vol. 1194 CCIS, pp. 245–253). Springer. https://doi.org/10.1007/978-3-030-42520-3_20

SMCS: Automatic Real-Time Classification of Ambient Sounds, Based on a Deep Neural Network and Mel Frequency Cepstral Coefficients

Abstract

Author supplied keywords

Cite

Register to see more suggestions