Most existing automatic chord recognition systems use a chromagram in front-end processing and some sort of classifier (e.g., hidden Markov model, Gaussian mixture model (GMM), support vector machine, or other template matching technique). The vast majority of front-end algorithms derive acoustic features based on a standard short-time Fourier analysis and on mapping energy from the power spectrum, or from a constant-Q spectrum, to chroma bins. However, the accuracy of the resulting spectral representation is a crucial issue. In fact, conventional methods based on short-time Fourier analysis involve an intrinsic trade-off between time resolution and frequency resolution. This work investigates an alternative feature set based on time-frequency reassignment, which was applied in the past to speech processing tasks such as formant extraction. As shown in the following experiments, the reassigned spectrum provides a very accurate front-end for the GMM-based chord recognition system here investigated. © 2013 Khadkevich and Omologo; licensee Springer.
CITATION STYLE
Khadkevich, M., & Omologo, M. (2013). Reassigned spectrum-based feature extraction for GMM-based automatic chord recognition. Eurasip Journal on Audio, Speech, and Music Processing, 2013(1). https://doi.org/10.1186/1687-4722-2013-15
Mendeley helps you to discover research relevant for your work.