Online speaker clustering using incremental learning of an ergodic hidden Markov model

Takafumi Koshinaka; Kentaro Nagatomo; Koichi Shinoda

Journal ArticleOPEN ACCESS

Online speaker clustering using incremental learning of an ergodic hidden Markov model

IEICE Transactions on Information and Systems (2012) E95-D(10) 2469-2478

DOI: 10.1587/transinf.E95.D.2469

4Citations

5Readers

Abstract

A Novel online speaker clustering method based on a generative model is proposed. It employs an incremental variant of variational Bayesian learning and provides probabilistic (non-deterministic) decisions for each input utterance, on the basis of the history of preceding utterances. It can be expected to be robust against errors in cluster estimation and the classification of utterances, and hence to be applicable to many real-time applications. Experimental results show that it produces 50% fewer classification errors than does a conventional online method. They also show that it is possible to reduce the number of speech recognition errors by combining the method with unsupervised speaker adaptation. Copyright © 2012 The Institute of Electronics, Information and Communication Engineers.

Author supplied keywords

Cite

CITATION STYLE

APA

Koshinaka, T., Nagatomo, K., & Shinoda, K. (2012). Online speaker clustering using incremental learning of an ergodic hidden Markov model. IEICE Transactions on Information and Systems, E95-D(10), 2469–2478. https://doi.org/10.1587/transinf.E95.D.2469

Online speaker clustering using incremental learning of an ergodic hidden Markov model

Abstract

Author supplied keywords

Cite

Register to see more suggestions