Robust voice activity detection based on noise eigenspace

6Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

In this study, we propose a voice activity detector (VAD) based on a noise eigenspace, which improve the robustness of VAD by utilizing the compression capability of the eigenspace. A noise eigenspace is constructed by using eigenvalue decomposition of the noise correlation matrix. When noisy speech is projected into the noise eigenspace, the noise energy is packed into a few dimensions with large eigenvalues, and those dimensions hopefully possess relatively less speech, because the speech energy distribution is usually different from noise energy distribution. The noise can be reduced by discarding those dimensions with large noise energy, while no significant loss occurs in speech. To track noise variation, the noise eigenspace is periodically updated, where the computation cost for eigenspace construction can be kept at an acceptable level. The proposed VAD was evaluated using the TIMIT database mixed with several noises. The experiment showed that the proposed VAD is more accurate than previous VADs in noisy environments. © 2007 The Acoustical Society of Japan.

Cite

CITATION STYLE

APA

Ying, D., Shi, Y., Lu, X., Dang, J., & Soong, F. (2007). Robust voice activity detection based on noise eigenspace. Acoustical Science and Technology, 28(6), 413–423. https://doi.org/10.1250/ast.28.413

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free