In this paper, we study the validity of the assumption that speech source signals exhibit lower dependency and therefore better separability with Independent Component Analysis algorithms than music sources. In particular, we investigate some dependency measures in the temporal and the time-frequency domains, resp. in the framework of instantaneous and convolutive mixtures. Moreover, we test several ICA methods, based on the above dependency measures, on the same source signals. We experimentally show that speech and music sources tend to have the same mean behaviour for excerpt durations above 20 ms, but music signals provide more spread dependency measures and SIR values. Lastly, we experimentally show that Gaussian nonstationary mutual information is better suited to audio signals than mutual information. © Springer-Verlag Berlin Heidelberg 2009.
CITATION STYLE
Puigt, M., Vincent, E., & Deville, Y. (2009). Validity of the independence assumption for the separation of instantaneous and convolutive mixtures of speech and music sources. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5441, pp. 613–620). https://doi.org/10.1007/978-3-642-00599-2_77
Mendeley helps you to discover research relevant for your work.