A sampling-based speaker clustering using utterance-oriented Dirichlet process mixture model and its evaluation on large-scale data

2Citations
Citations of this article
92Readers
Mendeley users who have this article in their library.

Abstract

An infinite mixture model is applied to model-based speaker clustering with sampling-based optimization to make it possible to estimate the number of speakers. For this purpose, a framework of non-parametric Bayesian modeling is implemented with the Markov chain Monte Carlo and incorporated in the utterance-oriented speaker model. The proposed model is called the utterance-oriented Dirichlet process mixture model (UO-DPMM). The present paper demonstrates that UO-DPMM is successfully applied on large-scale data and outperforms the conventional hierarchical agglomerative clustering, especially for large amounts of utterances.

Cite

CITATION STYLE

APA

Tawara, N., Ogawa, T., Watanabe, S., Nakamura, A., & Kobayashi, T. (2015). A sampling-based speaker clustering using utterance-oriented Dirichlet process mixture model and its evaluation on large-scale data. APSIPA Transactions on Signal and Information Processing, 4. https://doi.org/10.1017/ATSIP.2015.19

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free