Speaker recognition using gaussian mixtures models

1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Control access to secret or personal information by using the speaker voice transmitted by long distance communication systems, such as the telephone system, requires accuracy and robustness of the identification or identity verification system, since the speech signal is distorted during the transmission process. Taking in consideration these requirements, a robust text independent speaker identifications system is proposed in which the speaker features are extracted using the Lineal Prediction Cepstral Coefficients (LPCEPSTRAL) and the Gaussian Mixture Models, which provides the features distribution and estimates the optimum model for each speaker, is used for identification. The proposed system, was evaluate using a data-base of 80 different speakers, with a pronoun phrase of 3-5s and digits in Japanese language stored during 4 months. Evaluation results show that proposed system achieves more than 90% of recognition rate. © Springer-Verlag Berlin Heidelberg 2001.

Cite

CITATION STYLE

APA

Simancas-Acevedo, E., Kurematsu, A., Miyatake, M. N., & Perez-Meana, H. (2001). Speaker recognition using gaussian mixtures models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2085 LNCS, pp. 287–294). Springer Verlag. https://doi.org/10.1007/3-540-45723-2_34

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free