Bayesian independent component analysis as applied to one-channel speech enhancement

2Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Our work applies a unifying Bayesian-Independent Component Analysis (BICA) framework in the context of speech enhancement and robust Automatic Speech Recognition (ASR). The corrupted speech waveform is reshaped in overlapping speech frames, and is assumed to be composed as a linear sum of the underlying clean speech and noise. Subsequently, a linear sum of latent independent functions is proposed to span each clean frame. Two different techniques are applied following a Bayesian formulation: In the first case the posterior probability of a clean speech frame is formed conditioned on the noisy one on which a maximum a posteriori (MAP) approach is applied, leading to Sparse Code Shrinkage (SCS) - a fairly new statistical technique originally presented to applied mathematics and image denoising, but its much promising potential for speech enhancement has not yet been exploited. In the second case, viewed within the Variational Bayes framework, the model for noisy speech generation is stated in a block-based fashion as a noisy, blind source separation problem from which we infer the independent basis functions that span the space of a speech frame and their mixing matrix, thus reconstructing directly the corresponding clean frames.

Cite

CITATION STYLE

APA

Potamitis, I., Fakotakis, N., & Kokkinakis, G. (2001). Bayesian independent component analysis as applied to one-channel speech enhancement. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2130, pp. 593–600). Springer Verlag. https://doi.org/10.1007/3-540-44668-0_83

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free