Audio fingerprinting has been an active research field typically used for music identification. Robust audio fingerprinting technology is used to successfully perform content-based audio identification regardless of the audio signal being subjected to various types of distortion. These distortions affect the time-frequency correlation relating to pitch and speed changes. In this paper, experiments are done using the computer vision technique ORB (Oriented FAST and Rotated BRIEF) for robust audio identification. Investigations are conducted for ORB, relating to its advantage of robustness against distortions including speed and pitch changes. The ORB prototype compares the features of the spectrogram image query to a database of spectrogram images of the songs. For the initial experiment, a Brute-Force matcher is used to compare the ORB descriptors. Results show that the ORB prototype performs robustly to real-world distortions with fast, reliable performance against distortions such as speed and pitch which justifies the research done.
CITATION STYLE
Williams, D., Pooransingh, A., & Saitoo, J. (2017). Efficient music identification using ORB descriptors of the spectrogram image. Eurasip Journal on Audio, Speech, and Music Processing, 2017(1). https://doi.org/10.1186/s13636-017-0114-4
Mendeley helps you to discover research relevant for your work.