Control access to secret or personal information by using the speaker voice transmitted by long distance communication systems, such as the telephone system, requires accuracy and robustness of the identification or identity verification system, since the speech signal is distorted during the transmission process. Taking in consideration these requirements, a robust text independent speaker identifications system is proposed in which the speaker features are extracted using the Lineal Prediction Cepstral Coefficients (LPCEPSTRAL) and the Gaussian Mixture Models, which provides the features distribution and estimates the optimum model for each speaker, is used for identification. The proposed system, was evaluate using a data-base of 80 different speakers, with a pronoun phrase of 3-5s and digits in Japanese language stored during 4 months. Evaluation results show that proposed system achieves more than 90% of recognition rate. © Springer-Verlag Berlin Heidelberg 2001.
CITATION STYLE
Simancas-Acevedo, E., Kurematsu, A., Miyatake, M. N., & Perez-Meana, H. (2001). Speaker recognition using gaussian mixtures models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2085 LNCS, pp. 287–294). Springer Verlag. https://doi.org/10.1007/3-540-45723-2_34
Mendeley helps you to discover research relevant for your work.