Speech/music classification using wavelet based feature extraction techniques

18Citations
Citations of this article
26Readers
Mendeley users who have this article in their library.

Abstract

Audio classification serves as the fundamental step towards the rapid growth in audio data volume. Due to the increasing size of the multimedia sources speech and music classification is one of the most important issues for multimedia information retrieval. In this work a speech/music discrimination system is developed which utilizes the Discrete Wavelet Transform (DWT) as the acoustic feature. Multi resolution analysis is the most significant statistical way to extract the features from the input signal and in this study, a method is deployed to model the extracted wavelet feature. Support Vector Machines (SVM) are based on the principle of structural risk minimization. SVM is applied to classify audio into their classes namely speech and music, by learning from training data. Then the proposed method extends the application of Gaussian Mixture Models (GMM) to estimate the probability density function using maximum likelihood decision methods. The system shows significant results with an accuracy of 94.5%. © 2014 Science Publications.

Cite

CITATION STYLE

APA

Ramalingam, T., & Dhanalakshmi, P. (2013). Speech/music classification using wavelet based feature extraction techniques. Journal of Computer Science, 10(1), 34–44. https://doi.org/10.3844/jcssp.2014.34.44

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free