Speaker identification system using Gaussian Mixture Model and Support Vector Machines (GMM-SVM) under noisy conditions

7Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

Background: Automatic Speaker Identification (SID) systems has been a major breakthrough and crucial in many real-world applications. Methods: This work addresses the SID task based on GMM-SVM in a three stage process. Firstly, the Gammatone Frequency Cepstral Coefficients (GFCC) and Mean Hilbert Envelope Coefficients (MHEC) of the speakers are extracted. Secondly, these features are modeled using Gaussian Mixture Model (GMM), on adapting the extracted acoustic features by mean, the corresponding super vectors are found and these vectors are trained using Support Vector Machine (SVM). Finally, the actual recognition is done by feeding the super vectors of them asked noisy test utterance by Ideal Binary Mask (IBM) into SVM model and their accuracy of recognition is compared for GFCC, MHEC and RASTA-MFCC in different noisy conditions. Findings: Evaluation results show that SID performance carried out with MHEC is extensively better than the performance of other two features. Applications: Major areas that implements automatic SIDs are forensics, surveillance and audio biometrics etc.

Cite

CITATION STYLE

APA

Dhinesh Kumar, R., Balaji Ganesh, A., & Sasikala, S. (2016). Speaker identification system using Gaussian Mixture Model and Support Vector Machines (GMM-SVM) under noisy conditions. Indian Journal of Science and Technology, 9(19). https://doi.org/10.17485/ijst/2016/v9i19/93870

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free