Speaker recognition using gaussian mixtures models

Eric Simancas-Acevedo; Akira Kurematsu; Mariko Nakano Miyatake; Hector Perez-Meana

Conference Proceedings

Speaker recognition using gaussian mixtures models

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2001) 2085 LNCS(PART 2) 287-294

DOI: 10.1007/3-540-45723-2_34

1Citations

3Readers

Get full text

Abstract

Control access to secret or personal information by using the speaker voice transmitted by long distance communication systems, such as the telephone system, requires accuracy and robustness of the identification or identity verification system, since the speech signal is distorted during the transmission process. Taking in consideration these requirements, a robust text independent speaker identifications system is proposed in which the speaker features are extracted using the Lineal Prediction Cepstral Coefficients (LPCEPSTRAL) and the Gaussian Mixture Models, which provides the features distribution and estimates the optimum model for each speaker, is used for identification. The proposed system, was evaluate using a data-base of 80 different speakers, with a pronoun phrase of 3-5s and digits in Japanese language stored during 4 months. Evaluation results show that proposed system achieves more than 90% of recognition rate. © Springer-Verlag Berlin Heidelberg 2001.

Cite

CITATION STYLE

APA

Simancas-Acevedo, E., Kurematsu, A., Miyatake, M. N., & Perez-Meana, H. (2001). Speaker recognition using gaussian mixtures models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2085 LNCS, pp. 287–294). Springer Verlag. https://doi.org/10.1007/3-540-45723-2_34

Speaker recognition using gaussian mixtures models

Abstract

Cite

Register to see more suggestions