Progressive loss functions for speech enhancement with deep neural networks

Jorge Llombart; Dayana Ribas; Antonio Miguel; Luis Vicente; Alfonso Ortega; Eduardo Lleida

Journal ArticleOPEN ACCESS

Progressive loss functions for speech enhancement with deep neural networks

Eurasip Journal on Audio, Speech, and Music Processing (2021) 2021(1)

DOI: 10.1186/s13636-020-00191-3

10Citations

15Readers

Abstract

The progressive paradigm is a promising strategy to optimize network performance for speech enhancement purposes. Recent works have shown different strategies to improve the accuracy of speech enhancement solutions based on this mechanism. This paper studies the progressive speech enhancement using convolutional and residual neural network architectures and explores two criteria for loss function optimization: weighted and uniform progressive. This work carries out the evaluation on simulated and real speech samples with reverberation and added noise using REVERB and VoiceHome datasets. Experimental results show a variety of achievements among the loss function optimization criteria and the network architectures. Results show that the progressive design strengthens the model and increases the robustness to distortions due to reverberation and noise.

Author supplied keywords

Cite

CITATION STYLE

APA

Llombart, J., Ribas, D., Miguel, A., Vicente, L., Ortega, A., & Lleida, E. (2021). Progressive loss functions for speech enhancement with deep neural networks. Eurasip Journal on Audio, Speech, and Music Processing, 2021(1). https://doi.org/10.1186/s13636-020-00191-3

Progressive loss functions for speech enhancement with deep neural networks

Abstract

Author supplied keywords

Cite

Register to see more suggestions