Ensemble based speaker recognition using unsupervised data selection

2Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

This paper presents an ensemble-based speaker recognition using unsupervised data selection. Ensemble learning is a type of machine learning that applies a combination of several weak learners to achieve an improved performance than a single learner. A speech utterance is divided into several subsets based on its acoustic characteristics using unsupervised data selection methods. The ensemble classifiers are then trained with these non-overlapping subsets of speech data to improve the recognition accuracy. This new approach has two advantages. First, without any auxiliary information, we use ensemble classifiers based on unsupervised data selection to make use of different acoustic characteristics of speech data. Second, in ensemble classifiers, we apply the divide-and-conquer strategy to avoid a local optimization in the training of a single classifier. Our experiments on the 2010 and 2008 NIST Speaker Recognition Evaluation datasets show that using ensemble classifiers yields a significant performance gain.

Cite

CITATION STYLE

APA

Huang, C. L., Wang, J. C., & Ma, B. (2016, May 10). Ensemble based speaker recognition using unsupervised data selection. APSIPA Transactions on Signal and Information Processing. Cambridge University Press. https://doi.org/10.1017/ATSIP.2016.10

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free