Acoustic feature transformation using UBM-based LDA for speaker recognition

Chengzhu Yu; Gang Liu; John H.L. Hansen

Conference Proceedings

Acoustic feature transformation using UBM-based LDA for speaker recognition

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (2014) 1851-1854

DOI: 10.21437/interspeech.2014-420

10Citations

27Readers

Get full text

Abstract

In state-of-the-art speaker recognition system, universal background model (UBM) plays a role of acoustic space division. Each Gaussian mixture of trained UBM represents one distinct acoustic region. The posterior probabilities of features belonging to each region are further used as core components of Baum-Welch statistics. Therefore, the quality of estimated Baum-Welch statistics depends highly on how acoustic regions are separable with each other. In this paper, we propose to transform the front end acoustical features into a space where the separability of mixtures of trained UBM can be optimized. To achieve this, an UBM was first trained from the acoustical features and a transformation matrix is estimated using linear discriminant analysis (LDA) by treating each mixture of trained UBM as independent class. Therefore, the proposed method named as UBM-based LDA (uLDA) does not require any speaker labels or other supervised information. The obtained transformation matrix is then applied to acoustic features for i-Vector extraction. Experimental results on the male part of core conditions of NIST SRE 2010 dataset confirmed the improved performance using proposed method.

Author supplied keywords

Cite

CITATION STYLE

APA

Yu, C., Liu, G., & Hansen, J. H. L. (2014). Acoustic feature transformation using UBM-based LDA for speaker recognition. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp. 1851–1854). International Speech and Communication Association. https://doi.org/10.21437/interspeech.2014-420

Acoustic feature transformation using UBM-based LDA for speaker recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions