A latently constrained mixture model for audio source separation and localization

4Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present a method for audio source separation and localization from binaural recordings. The method combines a new generative probabilistic model with time-frequency masking. We suggest that device-dependent relationships between point-source positions and interaural spectral cues may be learnt in order to constrain a mixture model. This allows to capture subtle separation and localization features embedded in the auditory data. We illustrate our method with data composed of two and three mixed speech signals in the presence of reverberations. Using standard evaluation metrics, we compare our method with a recent binaural-based source separation-localization algorithm. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Deleforge, A., & Horaud, R. (2012). A latently constrained mixture model for audio source separation and localization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7191 LNCS, pp. 372–379). https://doi.org/10.1007/978-3-642-28551-6_46

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free