Transferring a song from one genre to another is most difficult if no instrumentation information is provided and genre is only defined by the timing and pitch of the played notes. Inspired by the CycleGAN music genre transfer presented in [2] we investigate whether recent additions to GAN training like spectral normalization and self-attention can improve transfer. Our preliminary results show that spectral normalization improves audible quality, while self-attention hurts content retention due to its non-locality. We further provide insights into genre attribution, showing that often only few notes are genre-decisive.
CITATION STYLE
Brunner, G., Moayeri, M., Richter, O., Wattenhofer, R., & Zhang, C. (2020). Neural Symbolic Music Genre Transfer Insights. In Communications in Computer and Information Science (Vol. 1168 CCIS, pp. 437–445). Springer. https://doi.org/10.1007/978-3-030-43887-6_36
Mendeley helps you to discover research relevant for your work.