This paper presents a unified speech enhancement system to remove both background noise and interfering speech in serious noise environments by jointly utilizing the parabolic reflector model and neural beamformer. First, the amplification property of paraboloid is discussed, which significantly improves the Signal-to-Noise Ratio (SNR) of a desired signal. Therefore, an appropriate paraboloid channel is analyzed and designed through the boundary element method. On the other hand, a time-frequency masking approach and a mask-based beamforming approach are discussed and incorporated in an enhancement system. It is worth noticing that signals provided by the paraboloid and the beamformer are exactly complementary. Finally, these signals are employed in a learning-based fusion framework to further improve the system performance in low SNR environments. Experiments demonstrate that our system is effective and robust in five different noisy conditions (speech interfered with factory, pink, destroyer engine, volvo, and babble noise), as well as in different noise levels. Compared with the original noisy speech, significant average objective metrics improvements are about ΔSTOI = 0.28, ΔPESQ = 1.31, ΔfwSegSNR = 11.9.
CITATION STYLE
Zhang, T., Geng, Y., Sun, J., Jiao, C., & Ding, B. (2020). A unified speech enhancement system based on neural beamforming with parabolic reflector. Applied Sciences (Switzerland), 10(7). https://doi.org/10.3390/app10072218
Mendeley helps you to discover research relevant for your work.