Abstract
Recent studies on computational speech segregation reported improved speech intelligibility in noise when estimating and applying an ideal binary mask with supervised learning algorithms. However, an important requirement for such systems in technical applications is their robustness to acoustic conditions not considered during training. This study demonstrates that the spectro-temporal noise variations that occur during training and testing determine the achievable segregation performance. In particular, such variations strongly affect the identification of acoustical features in the system associated with perceptual attributes in speech segregation. The results could help establish a framework for a systematic evaluation of future segregation systems.
Cite
CITATION STYLE
May, T., & Dau, T. (2014). Requirements for the evaluation of computational speech segregation systems. The Journal of the Acoustical Society of America, 136(6), EL398–EL404. https://doi.org/10.1121/1.4901133
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.