Weakly supervised audio source separation via spectrum energy preserved Wasserstein learning

8Citations
Citations of this article
40Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Separating audio mixtures into individual instrument tracks has been a standing challenge. We introduce a novel weakly supervised audio source separation approach based on deep adversarial learning. Specifically, our loss function adopts the Wasserstein distance which directly measures the distribution distance between the separated sources and the real sources for each individual source. Moreover, a global regularization term is added to fulfill the spectrum energy preservation property regardless separation. Unlike state-of-the-art weakly supervised models which often involve deliberately devised constraints or careful model selection, our approach need little prior model specification on the data, and can be straightforwardly learned in an end-to-end fashion. We show that the proposed method performs competitively on public benchmark against state-of-the-art weakly supervised methods.

Cite

CITATION STYLE

APA

Zhang, N., Yan, J., & Zhou, Y. (2018). Weakly supervised audio source separation via spectrum energy preserved Wasserstein learning. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2018-July, pp. 4574–4580). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2018/636

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free