This paper exploits non-negative matrix factorization (NMF)-based method for speech enhancement within speaker identification framework. The proposed algorithm considers speech atoms in deterministic way as a sum of harmonically-related sinusoids in spectral domain. This approach allows us to estimate specific signal structure of vowel signal in the presence of noise in order to make an efficient noise reduction using only noise exemplars. The experiments of the present research in application to the speaker identification are conducted on the computational hearing in multisource environments (CHiME) dataset. The obtained results demonstrate the effectiveness of the preprocessing enhancement, and outperforming the general NMF-based speech enhancer. Further studies show the channel compensation effect of the proposed method leads to performance comparable to the common mismatch reduction methods such as feature warping.
CITATION STYLE
Lyubimov, N., Nastasenko, M., Kotov, M., & Doroshin, D. (2014). Exploiting non-negative matrix factorization with linear constraints in noise-robust speaker identification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8773, pp. 200–208). Springer Verlag. https://doi.org/10.1007/978-3-319-11581-8_25
Mendeley helps you to discover research relevant for your work.