Self-Supervised Contrastive Learning for Robust Audio-Sheet Music Retrieval Systems

5Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Linking sheet music images to audio recordings remains a key problem for the development of efficient cross-modal music retrieval systems. One of the fundamental approaches toward this task is to learn a cross-modal embedding space via deep neural networks that is able to connect short snippets of audio and sheet music. However, the scarcity of annotated data from real musical content affects the capability of such methods to generalize to real retrieval scenarios. In this work, we investigate whether we can mitigate this limitation with self-supervised contrastive learning, by exposing a network to a large amount of real music data as a pre-Training step, by contrasting randomly augmented views of snippets of both modalities, namely audio and sheet images. Through a number of experiments on synthetic and real piano data, we show that pretrained models are able to retrieve snippets with better precision in all scenarios and pre-Training configurations. Encouraged by these results, we employ the snippet embeddings in the higher-level task of cross-modal piece identification and conduct more experiments on several retrieval configurations. In this task, we observe that the retrieval quality improves from 30% up to 100% when real music data is present. We then conclude by arguing for the potential of self-supervised contrastive learning for alleviating the annotated data scarcity in multi-modal music retrieval models. Code and trained models are accessible at https://github.com/luisfvc/ucasr.

Cite

CITATION STYLE

APA

Carvalho, L., Washüttl, T., & Widmer, G. (2023). Self-Supervised Contrastive Learning for Robust Audio-Sheet Music Retrieval Systems. In MMSys 2023 - Proceedings of the 14th ACM Multimedia Systems Conference (pp. 239–248). Association for Computing Machinery, Inc. https://doi.org/10.1145/3587819.3590968

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free