In this work, we adopt an information theoretic approach - the Information Bottleneck method - to extract the relevant modulation frequencies across both dimensions of a spectrogram, for speech / non-speech discrimination (music, animal vocalizations, environmental noises). A compact representation is built for each sound ensemble, consisting of the maximally informative features. We demonstrate the effectiveness of a simple thresholding classifier which is based on the similarity of a sound to each characteristic modulation spectrum. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Markaki, M., Wohlmayer, M., & Stylianou, Y. (2007). Extraction of speech-relevant information from modulation spectrograms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4391 LNCS, pp. 78–88). Springer Verlag. https://doi.org/10.1007/978-3-540-71505-4_5
Mendeley helps you to discover research relevant for your work.