Performance evaluation of PBDP based real-time speaker identification system with normal MFCC vs MFCC of LP residual features

Soma Khan; Joyanta Basu; Milton Samirakshma Bepari

Conference Proceedings

Performance evaluation of PBDP based real-time speaker identification system with normal MFCC vs MFCC of LP residual features

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7143 LNCS 358-366

DOI: 10.1007/978-3-642-27387-2_44

6Citations

3Readers

Get full text

Abstract

Present study compares, Mel Frequency Cepstral Coefficients (MFCC) of Linear Predictive (LP) Residuals with normal MFCC features using both VQ and GMM based speaker modeling approaches for performance evaluation of real- time Automatic Speaker Identification systems including both co-operative and non co-operative speaking scenario. Pitch Based Dynamic Pruning (PBDP) technique is applied regarding optimization of Speaker Identification process. System is trained and tested with voice samples of 62 speakers across different age groups. Residual of a signal contains information mostly about the source, which is speaker specific. Result shows that, in co-operative speaking, MFCC of LP residuals outperform normal MFCC features for both VQ and GMM based speaker modeling with an improvement of 7.6% and 6.8% in average accuracy respectively. But combined modeling of both features (source and vocal tract) is required for non co-operative speaking in real-time as it enhances the highest identification accuracy from 67.7% to 83%. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Khan, S., Basu, J., & Bepari, M. S. (2012). Performance evaluation of PBDP based real-time speaker identification system with normal MFCC vs MFCC of LP residual features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7143 LNCS, pp. 358–366). https://doi.org/10.1007/978-3-642-27387-2_44

Performance evaluation of PBDP based real-time speaker identification system with normal MFCC vs MFCC of LP residual features

Abstract

Author supplied keywords

Cite

Register to see more suggestions