This paper introduces acoustic events detection system capable of processing continuous input audio stream in order to detect potentially dangerous acoustic events. The system is representing a light, easy extendable, log-term running and complete solution to acoustic event detection. The system is based on its own approach to detection and classification of acoustic events using modified Viterbi decoding process using in combination with Weighted Finite-State Transducers (WFSTs) to support extensibility and acoustic modeling based on Hidden Markov Models (HMMs). Thesystem is completely programmed in C++ language and was designed to be self sufficient and to not require any additional dependencies. Additionally also a signal preprocessing part for feature extraction of Mel-Frequency Cepstral Coefficient(MFCC), Frequency Bank Coefficient (FBANK) and Mel-Spectral Coefficient (MELSPEC) is included. For robustness increase the system contains Cepstral Mean Normalization (CMN) and our proposed removal of basic coefficients from feature vector. © Springer International Publishing Switzerland 2014.
CITATION STYLE
Lojka, M., Pleva, M., Kiktová, E., Juhár, J., & Čižmár, A. (2014). EAR-TUKE: The Acoustic Event Detection System. In Communications in Computer and Information Science (Vol. 429, pp. 137–148). Springer Verlag. https://doi.org/10.1007/978-3-319-07569-3_11
Mendeley helps you to discover research relevant for your work.