This chapter presents state-of-the-art research and open topics for analyzing complex sound scenes in a single microphone case. First, the concept of sound scene recognition is presented, from the perspective of different paradigms (classification, tagging, clustering, segmentation) and methods used. The core section is on sound event detection and classification, presenting various paradigms and practical considerations along with methods for monophonic and polyphonic sound event detection. The chapter will then focus on the concepts of context and "language modeling" for sound scenes, also covering the concept of relationships between sound events. Work on sound scene recognition based on event detection is also presented. Finally the chapter will summarize the topic and will provide directions for future research.
CITATION STYLE
Benetos, E., Stowell, D., & Plumbley, M. D. (2017). Approaches to complex sound scene analysis. In Computational Analysis of Sound Scenes and Events (pp. 215–242). Springer International Publishing. https://doi.org/10.1007/978-3-319-63450-0_8
Mendeley helps you to discover research relevant for your work.