Abstract
De-essing is the process of attenuating vocal sibilance in audio recordings. Especially in audio mastering, conventional de-essers often degrade the clarity of the source signal due to unreliable differentiation between vocal sibilance and other high-pitched sounds. Machine learning poses a promising solution to this problem. In this context, a new de-essing approach based on a convolutional neural network architecture is presented. The introduced prototype de-esser outperforms existing de-esser plugins in terms of erroneous signal attenuation and was rated favorably by audio professionals.
Author supplied keywords
Cite
CITATION STYLE
Hestermann, S., & Deffner, N. (2020). Enhanced De-Essing via Neural Networks. In Communications in Computer and Information Science (Vol. 1168 CCIS, pp. 537–542). Springer. https://doi.org/10.1007/978-3-030-43887-6_49
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.