A comparative study of feature and score normalization for speaker verification

Rong Zheng; Shuwu Zhang; Bo Xu

Conference ProceedingsOPEN ACCESS

A comparative study of feature and score normalization for speaker verification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 3832 LNCS 531-538

DOI: 10.1007/11608288_71

15Citations

13Readers

Abstract

In speaker verification, it is necessary to reduce the influence of different environmental conditions. In this paper, two stages of normalization techniques, feature normalization and score normalization, are examined for decreasing the mismatch between training and testing acoustic conditions. At the first stage, cepstral mean and variance normalization (CMVN) is modified to normalize the cepstral coefficients with the similar segmental parameter statistics. Next, due to score variability between verification trials, Test-dependent zero-score normalization (TZnorm) and Zero-dependent test-score normalization (ZTnorm) are comparatively presented to transform the output scores entirely and make the speaker-independent decision threshold more robust under adverse conditions. Experiments on NIST2002 SRE corpus show that the normalizations with CMVN in feature stage and ZTnorm in score stage achieved 20.3% relative reduction of EER and 18.1% relative reduction of the minimal DCF compared to the baseline system using CMN and zero normalization. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Zheng, R., Zhang, S., & Xu, B. (2006). A comparative study of feature and score normalization for speaker verification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3832 LNCS, pp. 531–538). https://doi.org/10.1007/11608288_71

A comparative study of feature and score normalization for speaker verification

Abstract

Cite

Register to see more suggestions