Single-channel audio source separation with NMF: Divergences, constraints and algorithms

44Citations
Citations of this article
36Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Spectral decomposition by nonnegative matrix factorisation (NMF) has become state-of-the-art practice in many audio signal processing tasks, such as source separation, enhancement or transcription. This chapter reviews the fundamentals of NMF-based audio decomposition, in unsupervised and informed settings. We formulate NMF as an optimisation problem and discuss the choice of the measure of fit. We present the standard majorisation-minimisation strategy to address optimisation for NMF with the common β -divergence, a family of measures of fit that takes the quadratic cost, the generalised Kullback-Leibler divergence and the Itakura-Saito divergence as special cases. We discuss the reconstruction of time-domain components from the spectral factorisation and present common variants of NMF-based spectral decomposition: supervised and informed settings, regularised versions, temporal models.

Cite

CITATION STYLE

APA

Févotte, C., Vincent, E., & Ozerov, A. (2018). Single-channel audio source separation with NMF: Divergences, constraints and algorithms. In Signals and Communication Technology (pp. 1–24). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-319-73031-8_1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free