Non-linear speech feature extraction for phoneme classification and speaker recognition

Mohamed Chetouani; Marcos Faundez-Zanuy; Bruno Gas; Jean Luc Zarader

Conference Proceedings

Non-linear speech feature extraction for phoneme classification and speaker recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3445 LNAI 344-350

DOI: 10.1007/11520153_16

7Citations

5Readers

Get full text

Abstract

In this paper we propose a new feature extraction algorithm based on non-linear prediction: the Neural Predictive Coding (NPC) model which is an extension of the classical LPC one. We apply this model to two significant tasks: phoneme classification and speaker identification. For the first one, the NPC model is trained with a Minimum Classification Error (MCE) criterion. The experiments carried out with the NTIMIT database show an improvement of the classification rates. For speaker identification, we propose a new feature extraction principle based on the NPC model. We also investigate different initialization methods. The new method gives better performances than the traditional ones (LPC, MFCC and PLP). © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Chetouani, M., Faundez-Zanuy, M., Gas, B., & Zarader, J. L. (2005). Non-linear speech feature extraction for phoneme classification and speaker recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3445 LNAI, pp. 344–350). Springer Verlag. https://doi.org/10.1007/11520153_16

Non-linear speech feature extraction for phoneme classification and speaker recognition

Abstract

Cite

Register to see more suggestions