Comparative evaluation of feature normalization techniques for speaker verification

Md Jahangir Alam; Pierre Ouellet; Patrick Kenny; Douglas O'Shaughnessy

Conference Proceedings

Comparative evaluation of feature normalization techniques for speaker verification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 7015 LNAI 246-253

DOI: 10.1007/978-3-642-25020-0_32

42Citations

49Readers

Get full text

Abstract

This paper investigates several feature normalization techniques for use in an i-vector speaker verification system based on a mixture probabilistic linear discriminant analysis (PLDA) model. The objective of the feature normalization technique is to compensate for the effects of environmental mismatch. Here, we study short-time Gaussianization (STG), short-time mean and variance normalization (STMVN), and short-time mean and scale normalization (STMSN) techniques. Our goal is to compare the performance of the above mentioned feature normalization techniques on the telephone (det5) and microphone speech (det1, det2, det3 and det4) of the NIST SRE 2010 corpora. Experimental results show that the performances of the STMVN and STMSN techniques are comparable to that of the STG technique. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Alam, M. J., Ouellet, P., Kenny, P., & O’Shaughnessy, D. (2011). Comparative evaluation of feature normalization techniques for speaker verification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7015 LNAI, pp. 246–253). https://doi.org/10.1007/978-3-642-25020-0_32

Comparative evaluation of feature normalization techniques for speaker verification

Abstract

Author supplied keywords

Cite

Register to see more suggestions