Two-Stage Single-Channel Audio Source Separation Using Deep Neural Networks

46Citations
Citations of this article
47Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Most single channel audio source separation approaches produce separated sources accompanied by interference from other sources and other distortions. To tackle this problem, we propose to separate the sources in two stages. In the first stage, the sources are separated from the mixed signal. In the second stage, the interference between the separated sources and the distortions are reduced using deep neural networks (DNNs). We propose two methods that use DNNs to improve the quality of the separated sources in the second stage. In the first method, each separated source is improved individually using its own trained DNN, while in the second method all the separated sources are improved together using a single DNN. To further improve the quality of the separated sources, the DNNs in the second stage are trained discriminatively to further decrease the interference and the distortions of the separated sources. Our experimental results show that using two stages of separation improves the quality of the separated signals by decreasing the interference between the separated sources and distortions compared to separating the sources using a single stage of separation.

Cite

CITATION STYLE

APA

Grais, E. M., Roma, G., Simpson, A. J. R., & Plumbley, M. D. (2017). Two-Stage Single-Channel Audio Source Separation Using Deep Neural Networks. IEEE/ACM Transactions on Audio Speech and Language Processing, 25(9), 1773–1783. https://doi.org/10.1109/TASLP.2017.2716443

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free