This paper presents an automatic speech-based classification scheme to classify speaker characteristics. In the training phase, speech data are grouped into speaker groups according to speakers' gender, age and accent. Voice features are then extracted to feature vectors which are used to train speaker characteristic models with different techniques which are Vector Quantization, Gaussian Mixture Model and Support Vector Machine. Fusion of classification results from those groups is then performed to obtain final classification results for each characteristic. The Australian National Database of Spoken Language (ANDOSL) corpus was used for evaluation of gender, age and accent classification. Experiments showed high performance for the proposed classification scheme. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Nguyen, P., Tran, D., Huang, X., & Sharma, D. (2010). Automatic speech-based classification of gender, age and accent. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6232 LNAI, pp. 288–299). https://doi.org/10.1007/978-3-642-15037-1_24
Mendeley helps you to discover research relevant for your work.