FastMVAE: A Fast Optimization Algorithm for the Multichannel Variational Autoencoder Method

Li Li; Hirokazu Kameoka; Shota Inoue; Shoji Makino

Journal ArticleOPEN ACCESS

FastMVAE: A Fast Optimization Algorithm for the Multichannel Variational Autoencoder Method

IEEE Access (2020) 8 228740-228753

DOI: 10.1109/ACCESS.2020.3045704

3Citations

9Readers

Abstract

This paper proposes a fast optimization algorithm for the multichannel variational autoencoder (MVAE) method, a recently proposed powerful multichannel source separation technique. The MVAE method can achieve good source separation performance thanks to a convergence-guaranteed optimization algorithm and the idea of jointly performing multi-speaker separation and speaker identification. However, one drawback is the high computational cost of the optimization algorithm. To overcome this drawback, this paper proposes using an auxiliary classifier VAE, an information-theoretic extension of the conditional VAE (CVAE), to train the generative model of the source spectrograms and using it to efficiently update the parameters of the source spectrogram models at each iteration of the source separation algorithm. We call the proposed algorithm 'FastMVAE' (or fMVAE for short). Experimental evaluations revealed that the proposed fast algorithm can achieve high source separation performance in both speaker-dependent and speaker-independent scenarios while significantly reducing the computational time compared to the original MVAE method by more than 90% on both GPU and CPU. However, there is still room for improvement of about 3 dB compared to the original MVAE method.

Author supplied keywords

Cite

CITATION STYLE

APA

Li, L., Kameoka, H., Inoue, S., & Makino, S. (2020). FastMVAE: A Fast Optimization Algorithm for the Multichannel Variational Autoencoder Method. IEEE Access, 8, 228740–228753. https://doi.org/10.1109/ACCESS.2020.3045704

FastMVAE: A Fast Optimization Algorithm for the Multichannel Variational Autoencoder Method

Abstract

Author supplied keywords

Cite

Register to see more suggestions