Speaker recognition with mixtures of gaussians with sparse regression matrices

0Citations
Citations of this article
69Readers
Mendeley users who have this article in their library.

Abstract

When estimating a mixture of Gaussians there are usually two choices for the covariance type of each Gaussian component. Either diagonal or full covariance. Imposing a structure though may be restrictive and lead to degraded performance and/or increased computations. In this work, several criteria to estimate the structure of regression matrices of a mixture of Gaussians are introduced and evaluated. Most of the criteria attempt to estimate a discriminative structure, which is suited for classification tasks. Results are reported on the 1996 NIST speaker recognition task and performance is compared with structural EM, a well-known, non-discriminative, structure-finding algorithm.

Cite

CITATION STYLE

APA

Boulis, C. (2004). Speaker recognition with mixtures of gaussians with sparse regression matrices. In HLT-NAACL 2004 - Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Student Research Workshop (pp. 55–60). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1614038.1614048

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free