In this paper, various temporal features (i.e., zero crossing rate and short-time energy) and spectral features (spectral flux and spectral centroid) have been derived from the Teager energy operator (TEO) profile of the speech waveform. The efficacy of these features has been analyzed for the classification of normal and dysphonic voices by comparing their performance with the features derived from the linear prediction (LP) residual and the speech waveform. In addition, the effectiveness of fusing these features with state-of-the-art Mel frequency cepstral coefficients (MFCC) feature-set has also been investigated to understand whether these features provide complementary results. The classifier that has been used is the 2nd order polynomial classifier, with experiments being carried out on a subset of the Massachusetts Eye and Ear Infirmary (MEEI) database. © 2012 Springer-Verlag GmbH Berlin Heidelberg.
CITATION STYLE
Patil, H. A., Baljekar, P. N., & Basu, T. K. (2012). Novel temporal and spectral features derived from TEO for classification normal and dysphonic voices. In Advances in Intelligent and Soft Computing (Vol. 133 AISC, pp. 559–567). https://doi.org/10.1007/978-3-642-27552-4_76
Mendeley helps you to discover research relevant for your work.