In this paper a convolutional neural network is applied to the problem of note onset detection in audio recordings. Two time-frequency representations are analysed, showing the superiority of standard spectrogram over enhanced autocorrelation (EAC) used as the input to the convolutional network. Experimental evaluation is based on a dataset containing 10,939 annotated onsets, with total duration of the audio recordings of over 45 min.
CITATION STYLE
Stasiak, B., & Monko, J. (2016). Analysis of time-frequency representations for musical onset detection with convolutional neural network. In Proceedings of the 2016 Federated Conference on Computer Science and Information Systems, FedCSIS 2016 (pp. 147–152). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.15439/2016F558
Mendeley helps you to discover research relevant for your work.