Neural-response-based text-dependent speaker identification under noisy conditions

3Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Speaker identification is a technique of determining an unknown speaker’s identity and is very essential for security, crime investigation, and telephoning. In this study, a text-dependent speaker identification technique using the neural responses of a physiologically-based computational model of the auditory periphery is proposed. Neurograms were constructed from the responses of the auditory-nerve model to sentences of different speakers. The proposed features were then used to train and test the recognition system using the support vector machine and Gaussian mixture model classification techniques. The proposed method was tested on a textdependent database in quiet and under noisy conditions for a range of signal-to-noise ratios. Although the performance of the proposed method in quiet was comparable to the performance of a traditional Mel frequency cepstral coefficientsbased method and also to the result of a very recent Gammatone frequency cepstral coefficient-based system, the neuralresponse- based method showed a substantially better classification accuracy under noisy conditions. The proposed method could be extended to design a text-independent speaker identification and verification system.

Cite

CITATION STYLE

APA

Islam, M. A., Zilany, M. S. A., & Jassim, A. J. (2016). Neural-response-based text-dependent speaker identification under noisy conditions. In IFMBE Proceedings (Vol. 56, pp. 11–14). Springer Verlag. https://doi.org/10.1007/978-981-10-0266-3_3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free