Abstract
Spectral subband centroids (SSC) have been used as an additional feature to cepstral coefficients in speech and speaker recognition. SSCs are computed as the centroid frequencies of subbands and they capture the dominant frequencies of the short-term spectrum. In the baseline SSC method, the subband filters are pre-specified. To allow better adaptation to formant movements and other dynamic phenomena, we propose to adapt the subband filter boundaries on a frame-by-frame basis using a globally optimal scalar quantization scheme. The method has only one control parameter, the number of subbands. Speaker verification results on the NIST 2001 task indicate that the selection of the parameter is not critical and that the method does not require additional feature normalization. © Springer-Verlag Berlin Heidelberg 2007.
Cite
CITATION STYLE
Kinnunen, T., Zhang, B., Zhu, J., & Wang, Y. (2007). Speaker verification with adaptive spectral subband centroids. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4642 LNCS, pp. 58–66). Springer Verlag. https://doi.org/10.1007/978-3-540-74549-5_7
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.