Audio features selection for automatic height estimation from speech

Todor Ganchev; Iosif Mporas; Nikos Fakotakis

Conference Proceedings

Audio features selection for automatic height estimation from speech

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6040 LNAI 81-90

DOI: 10.1007/978-3-642-12842-4_12

20Citations

7Readers

Get full text

Abstract

Aiming at the automatic estimation of the height of a person from speech, we investigate the applicability of various subsets of speech features, which were formed on the basis of ranking the relevance and the individual quality of numerous audio features. Specifically, based on the relevance ranking of the large set of openSMILE audio descriptors, we performed selection of subsets with different sizes and evaluated them on the height estimation task. In brief, during the speech parameterization process, every input utterance is converted to a single feature vector, which consists of 6552 parameters. Next, a subset of this feature vector is fed to a support vector machine (SVM)-based regression model, which aims at the straight estimation of the height of an unknown speaker. The experimental evaluation performed on the TIMIT database demonstrated that: (i) the feature vector composed of the top-50 ranked parameters provides a good trade-off between computational demands and accuracy, and that (ii) the best accuracy, in terms of mean absolute error and root mean square error, is observed for the top-200 subset. © Springer-Verlag Berlin Heidelberg 2010.

Author supplied keywords

Cite

CITATION STYLE

APA

Ganchev, T., Mporas, I., & Fakotakis, N. (2010). Audio features selection for automatic height estimation from speech. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6040 LNAI, pp. 81–90). https://doi.org/10.1007/978-3-642-12842-4_12

Audio features selection for automatic height estimation from speech

Abstract

Author supplied keywords

Cite

Register to see more suggestions