Fusion of acoustic and tokenization features for speaker recognition

Rong Tong; Bin Ma; Kong Aik Lee; Changhuai You; Donglai Zhu; Tomi Kinnunen; Hanwu Sun; Minghui Dong; Eng Siong Chng; Haizhou Li

Conference Proceedings

Fusion of acoustic and tokenization features for speaker recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4274 LNAI 566-577

DOI: 10.1007/11939993_59

5Citations

14Readers

Get full text

Abstract

This paper describes our recent efforts in exploring effective discriminative features for speaker recognition. Recent researches have indicated that the appropriate fusion of features is critical to improve the performance of speaker recognition system. In this paper we describe our approaches for the NIST 2006 Speaker Recognition Evaluation. Our system integrated the cepstral GMM modeling, cepstral SVM modeling and tokenization at both phone level and frame level. The experimental results on both NIST 2005 SRE corpus and NIST 2006 SRE corpus are presented. The fused system achieved 8.14% equal error rate on 1conv4w-1conv4w test condition of the NIST 2006 SRE. © 2006 Springer-Verlag Berlin/Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Tong, R., Ma, B., Lee, K. A., You, C., Zhu, D., Kinnunen, T., … Li, H. (2006). Fusion of acoustic and tokenization features for speaker recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4274 LNAI, pp. 566–577). https://doi.org/10.1007/11939993_59

Fusion of acoustic and tokenization features for speaker recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions