Synthesizer is a type of electronic musical instrument that is now widely used in modern music production and sound design. Each parameter configuration of a synthesizer produces a unique timbre and can be viewed as a unique instrument. The problem of estimating a set of parameters configuration that best restore a sound timbre is an important yet complicated problem, i.e.: the synthesizer parameters estimation problem. We proposed a multi-modal deep-learning-based pipeline SOUND2SYNTH, together with a network structure Prime-Dilated Convolution (PDC) specially designed to solve this problem. Our method achieved not only SOTA but also the first real-world applicable results on the Dexed synthesizer, a popular FM synthesizer.
CITATION STYLE
Chen, Z., Jing, Y., Yuan, S., Xu, Y., Wu, J., & Zhao, H. (2022). SOUND2SYNTH: Interpreting Sound via FM Synthesizer Parameters Estimation. In IJCAI International Joint Conference on Artificial Intelligence (pp. 4921–4928). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2022/682
Mendeley helps you to discover research relevant for your work.