Speaker verification with adaptive spectral subband centroids

Tomi Kinnunen; Bingjun Zhang; Jia Zhu; Ye Wang

Conference ProceedingsOPEN ACCESS

Speaker verification with adaptive spectral subband centroids

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4642 LNCS 58-66

DOI: 10.1007/978-3-540-74549-5_7

24Citations

22Readers

Abstract

Spectral subband centroids (SSC) have been used as an additional feature to cepstral coefficients in speech and speaker recognition. SSCs are computed as the centroid frequencies of subbands and they capture the dominant frequencies of the short-term spectrum. In the baseline SSC method, the subband filters are pre-specified. To allow better adaptation to formant movements and other dynamic phenomena, we propose to adapt the subband filter boundaries on a frame-by-frame basis using a globally optimal scalar quantization scheme. The method has only one control parameter, the number of subbands. Speaker verification results on the NIST 2001 task indicate that the selection of the parameter is not critical and that the method does not require additional feature normalization. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Kinnunen, T., Zhang, B., Zhu, J., & Wang, Y. (2007). Speaker verification with adaptive spectral subband centroids. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4642 LNCS, pp. 58–66). Springer Verlag. https://doi.org/10.1007/978-3-540-74549-5_7

Speaker verification with adaptive spectral subband centroids

Abstract

Cite

Register to see more suggestions