A weighted feature voting approach for robust and real-time voice activity detection

N/ACitations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

This paper concerns a robust real-time voice activity detection (VAD) approach which is easy to understand and implement. The proposed approach employs several shortterm speech/nonspeech discriminating features in a voting paradigm to achieve a reliable performance in different environments. This paper mainly focuses on the performance improvement of a recently proposed approach which uses spectral peak valley difference (SPVD) as a feature for silence detection. The main issue of this paper is to apply a set of features with SPVD to improve the VAD robustness. The proposed approach uses a weighted voting scheme in order to take the discriminative power of the employed feature set into account. The experiments show that the proposed approach is more robust than the baseline approach from different points of view, including channel distortion and threshold selection. The proposed approach is also compared with some other VA D techniques for better confirmation of its achievements. Using the proposed weighted voting approach, the average VA D performance is increased to 89.29% for 5 different noise types and 8 SNR levels. The resulting performance is 13.79% higher than the approach based only on SPVD and even 2.25% higher than the not-weighted voting scheme. © 2011 ETRI.

Cite

CITATION STYLE

APA

Moattar, M. H., & Homayounpour, M. M. (2011). A weighted feature voting approach for robust and real-time voice activity detection. ETRI Journal, 33(1), 99–109. https://doi.org/10.4218/etrij.11.1510.0158

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free