Multi-scale audio super resolution via deep pyramid wavelet convolutional neural network

Binqiang Si; Dongqi Luo; Jihong Zhu

Journal ArticleOPEN ACCESS

Multi-scale audio super resolution via deep pyramid wavelet convolutional neural network

Electronics Letters (2021) 57(13) 520-522

DOI: 10.1049/ell2.12180

2Citations

6Readers

Abstract

In this letter, a pyramid wavelet convolutional neural network for audio super resolution is presented. Since the audio signal is non-stationary, previous convolutional neural network based approaches may fail in capturing the details, these method usually focus on the global approximation error and thus produce over smooth results. To cope with this issue, it is suggested to predict the wavelet coefficients of the audio signal, and reconstruct the signal from these coefficients stage by stage rather. The prediction errors of the wavelet coefficients are included to the loss function to force the model to capture the detail components. Experimental results show that the approach, training on the VCTK public dataset, achieves more appealing results than state-of-the-art methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Si, B., Luo, D., & Zhu, J. (2021). Multi-scale audio super resolution via deep pyramid wavelet convolutional neural network. Electronics Letters, 57(13), 520–522. https://doi.org/10.1049/ell2.12180

Multi-scale audio super resolution via deep pyramid wavelet convolutional neural network

Abstract

Author supplied keywords

Cite

Register to see more suggestions