Speaker recognition with mixtures of gaussians with sparse regression matrices

Constantinos Boulis

Conference ProceedingsOPEN ACCESS

Speaker recognition with mixtures of gaussians with sparse regression matrices

Boulis C

HLT-NAACL 2004 - Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Student Research Workshop (2004) 55-60

DOI: 10.3115/1614038.1614048

0Citations

72Readers

Abstract

When estimating a mixture of Gaussians there are usually two choices for the covariance type of each Gaussian component. Either diagonal or full covariance. Imposing a structure though may be restrictive and lead to degraded performance and/or increased computations. In this work, several criteria to estimate the structure of regression matrices of a mixture of Gaussians are introduced and evaluated. Most of the criteria attempt to estimate a discriminative structure, which is suited for classification tasks. Results are reported on the 1996 NIST speaker recognition task and performance is compared with structural EM, a well-known, non-discriminative, structure-finding algorithm.

Cite

CITATION STYLE

APA

Boulis, C. (2004). Speaker recognition with mixtures of gaussians with sparse regression matrices. In HLT-NAACL 2004 - Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Student Research Workshop (pp. 55–60). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1614038.1614048

Speaker recognition with mixtures of gaussians with sparse regression matrices

Abstract

Cite

Register to see more suggestions