For stereo audio surveillance in complex environment, we proposed a bottom-up audio attention model based on spatial audio cues analysis, and an environment adaptive normalization method. The traditional audio attention models are based on mono audio characters, such as energy, energy peak, or pitch. Their performance is limited by neglecting the spatial information. The spatial cues in audio stream provide additional information for attention analysis. And the dynamic updated background sound can help to reduce the environment effect. The preliminary experiment showed that the proposed model is an effective way to analyzing attention events, which is caused by rapid moving sound source, in stereo audio stream. © 2009 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Hang, B., Hu, R., Yang, Y., Ma, Y., & Chang, J. (2009). Surveillance audio attention model based on spatial audio cues. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5879 LNCS, pp. 908–916). https://doi.org/10.1007/978-3-642-10467-1_81
Mendeley helps you to discover research relevant for your work.