This paper presents effect of possible integrations of delta derivatives and log energy with MFCC for text-independent speaker identification. MFCC features extracted from speech signal are used to create speaker model using vector quantization. First, the effect of varying MFCC filters and centroids of vector quantization is compared. Next, MFCC scheme is combined with delta derivatives and log energy. The effect of these possible combinations is compared by varying MFCC filters and centroids of vector quantization. Among all experiments carried out on 120 speakers of TIMIT database, average identification rate of 99.58 % is achieved for 29 MFCC filters and 32 centroids of vector quantization.
CITATION STYLE
Dhonde, S. B., Chaudhari, A., & Jagade, S. M. (2017). Integration of mel-frequency cepstral coefficients with log energy and temporal derivatives for text-independent speaker identification. In Advances in Intelligent Systems and Computing (Vol. 468, pp. 791–797). Springer Verlag. https://doi.org/10.1007/978-981-10-1675-2_78
Mendeley helps you to discover research relevant for your work.