When applied to interactive seminars, the detection of acoustic events from only audio information shows a large amount of errors, which are mostly due to the temporal overlaps of sounds. Video signals may be a useful additional source of information to cope with that problem for particular events. In this work, we aim at improving the detection of steps by using two audio-based Acoustic Event Detection (AED) systems, with SVM and HMM, and a video-based AED system, which employs the output of a 3D video tracking algorithm. The fuzzy integral is used to fuse the outputs of the three detection systems. Experimental results using the CLEAR 2007 evaluation data show that video information can be successfully used to improve the results of audio-based AED. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Butko, T., Temko, A., Nadeu, C., & Canton, C. (2008). Inclusion of video information for detection of acoustic events using the fuzzy integral. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5237 LNCS, pp. 74–85). Springer Verlag. https://doi.org/10.1007/978-3-540-85853-9_7
Mendeley helps you to discover research relevant for your work.