Emotion classification based on speaking rate

Shashidhar G. Koolagudi; Sudhin Ray; K. Sreenivasa Rao

Conference Proceedings

Emotion classification based on speaking rate

Communications in Computer and Information Science (2010) 94 CCIS(PART 1) 316-327

DOI: 10.1007/978-3-642-14834-7_30

5Citations

3Readers

Get full text

Abstract

In this paper, vocal tract characteristics related to speaking rate are explored to categorise the emotions. The emotions considered are anger, disgust, fear, happy, neutral, sadness, sarcastic and surprise. These emotions are grouped into 3 broad categories namely normal, fast and slow based on speaking rate. Mel frequency cepstral coefficients (MFCC's) are used as features and Gaussian Mixture Models are used for developing the emotion classification models. The basic hypothesis is that the sequence of vocal tract shapes in producing the speech for the given utterance is unique with respect to the speaking rate. The overall classification performance of emotions using speaking rate is observed to be 91% in case of single female utterances. © 2010 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Koolagudi, S. G., Ray, S., & Sreenivasa Rao, K. (2010). Emotion classification based on speaking rate. In Communications in Computer and Information Science (Vol. 94 CCIS, pp. 316–327). https://doi.org/10.1007/978-3-642-14834-7_30

Emotion classification based on speaking rate

Abstract

Author supplied keywords

Cite

Register to see more suggestions