MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement

Jianrong Wang; Xiaomin Li; Xuewei Li; Mei Yu; Qiang Fang; Li Liu

Conference Proceedings

MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2023) 13624 LNCS 101-112

DOI: 10.1007/978-3-031-30108-7_9

0Citations

1Readers

Get full text

Abstract

Speech enhancement improves speech quality and promotes the performance of various downstream tasks. However, most current speech enhancement work was mainly devoted to improving the performance of downstream automatic speech recognition (ASR), only a relatively small amount of work focused on the automatic speaker verification (ASV) task. In this work, we propose a MVNet consisted of a memory assistance module which improves the performance of downstream ASR and a vocal reinforcement module to boosts the performance of ASV. In addition, we design a new loss function to improve speaker vocal similarity. Experimental results on the Libri2mix dataset show that our method outperforms baseline methods in several metrics, including speech quality, intelligibility, and speaker vocal similarity.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, J., Li, X., Li, X., Yu, M., Fang, Q., & Liu, L. (2023). MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13624 LNCS, pp. 101–112). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-30108-7_9

MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement

Abstract

Author supplied keywords

Cite

Register to see more suggestions