Music Generation Based on Convolution-LSTM

  • Huang Y
  • Huang X
  • Cai Q
N/ACitations
Citations of this article
18Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we propose a model that combines Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) for music generation. We first convert MIDI-format music file into a musical score matrix, and then establish convolution layers to extract feature of the musical score matrix. Finally, the output of the convolution layers is split in the direction of the time axis and input into the LSTM, so as to achieve the purpose of music generation. The result of the model was verified by comparison of accuracy, time-domain analysis, frequency-domain analysis and human-auditory evaluation. The results show that Convolution-LSTM performs better in music genertaion than LSTM, with more pronounced undulations and clearer melody.

Cite

CITATION STYLE

APA

Huang, Y., Huang, X., & Cai, Q. (2018). Music Generation Based on Convolution-LSTM. Computer and Information Science, 11(3), 50. https://doi.org/10.5539/cis.v11n3p50

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free