Frequency-domain blind separation of convolutive speech mixtures with energy correlation-based permutation correction

Li Dan Wang; Qiu Hua Lin

Conference Proceedings

Frequency-domain blind separation of convolutive speech mixtures with energy correlation-based permutation correction

Lecture Notes in Electrical Engineering (2010) 67 LNEE 381-390

DOI: 10.1007/978-3-642-12990-2_43

2Citations

6Readers

Get full text

Abstract

Blind separation of convolutive speech mixtures in frequency domain has obvious advantages in term of convergence and computation, but suffers from permutation ambiguity. Motivated by the fact that speech signals have strong correlations across frequency, the paper presents an energy correlation method for solving permutation ambiguity after separation of instantaneous speech mixtures at each frequency bin. Extensive experiments with synthetic and recorded speech signals are carried out to compare the energy correlation method to amplitude correlation method, three different complex-valued independent component analysis (ICA) algorithms are compared as well. The results show that the proposed method achieves better performance than the amplitude correlation method, and the complex ICA algorithm based on negentropy maximization yields the best separation. © 2010 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, L. D., & Lin, Q. H. (2010). Frequency-domain blind separation of convolutive speech mixtures with energy correlation-based permutation correction. In Lecture Notes in Electrical Engineering (Vol. 67 LNEE, pp. 381–390). https://doi.org/10.1007/978-3-642-12990-2_43

Frequency-domain blind separation of convolutive speech mixtures with energy correlation-based permutation correction

Abstract

Author supplied keywords

Cite

Register to see more suggestions