Mixtures of gamma priors for non-negative matrix factorization based speech separation

Tuomas Virtanen; Ali Taylan Cemgil

Conference Proceedings

Mixtures of gamma priors for non-negative matrix factorization based speech separation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5441 646-653

DOI: 10.1007/978-3-642-00599-2_81

19Citations

33Readers

Get full text

Abstract

This paper deals with audio source separation using supervised non-negative matrix factorization (NMF). We propose a prior model based on mixtures of Gamma distributions for each sound class, which hyperparameters are trained given a training corpus. This formulation allows adapting the spectral basis vectors of the sound sources during actual operation, when the exact characteristics of the sources are not known in advance. Simulations were conducted using a random mixture of two speakers. Even without adaptation the mixture model outperformed the basic NMF, and adaptation furher improved slightly the separation quality. Audio demonstrations are available at www.cs.tut.fi/~tuomasv. © Springer-Verlag Berlin Heidelberg 2009.

Cite

CITATION STYLE

APA

Virtanen, T., & Cemgil, A. T. (2009). Mixtures of gamma priors for non-negative matrix factorization based speech separation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5441, pp. 646–653). https://doi.org/10.1007/978-3-642-00599-2_81

Mixtures of gamma priors for non-negative matrix factorization based speech separation

Abstract

Cite

Register to see more suggestions