Two-microphone binary mask speech enhancement in diffuse and directional noise fields

Roohollah Abdipour; Ahmad Akbari; Mohsen Rahmani

Journal ArticleOPEN ACCESS

Two-microphone binary mask speech enhancement in diffuse and directional noise fields

ETRI Journal (2014) 36(5) 772-782

DOI: 10.4218/etrij.14.0113.0917

1Citations

9Readers

Abstract

Two-microphone binary mask speech enhancement (2mBMSE) has been of particular interest in recent literature and has shown promising results. Current 2mBMSE systems rely on spatial cues of speech and noise sources. Although these cues are helpful for directional noise sources, they lose their efficiency in diffuse noise fields. We propose a new system that is effective in both directional and diffuse noise conditions. The system exploits two features. The first determines whether a given time-frequency (T-F) unit of the input spectrum is dominated by a diffuse or directional source. A diffuse signal is certainly a noise signal, but a directional signal could correspond to a noise or speech source. The second feature discriminates between T-F units dominated by speech or directional noise signals. Speech enhancement is performed using a binary mask, calculated based on the proposed features. In both directional and diffuse noise fields, the proposed system segregates speech T-F units with hit rates above 85%. It outperforms previous solutions in terms of signal-to-noise ratio and perceptual evaluation of speech quality improvement, especially in diffuse noise conditions.

Author supplied keywords

Cite

CITATION STYLE

APA

Abdipour, R., Akbari, A., & Rahmani, M. (2014). Two-microphone binary mask speech enhancement in diffuse and directional noise fields. ETRI Journal, 36(5), 772–782. https://doi.org/10.4218/etrij.14.0113.0917

Two-microphone binary mask speech enhancement in diffuse and directional noise fields

Abstract

Author supplied keywords

Cite

Register to see more suggestions