Application of vector quantization in emotion recognition from human speech

11Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Recognition of emotions from speech is a complex task that is furthermore complicated by the fact that there is no unambiguous answer to what the "correct" emotion is for a given speech sample. In this paper, we discuss emotion classification of a well known German database consisting of 6 basic emotions: sadness, boredom, neutral, fear, happiness, and anger using Mel frequency Cepstral Coefficients (MFCCs). A concern with MFCC is the large number of features. We discuss the use of LBG-VQ algorithm to minimize the amount of data to be handled. At last, emotion classification is done using Euclidean distance, Manhattan distance and Chebyshev distance of the codebooks between neutral state and other emotional states for the same sample. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Khanna, P., & Sasi Kumar, M. (2011). Application of vector quantization in emotion recognition from human speech. In Communications in Computer and Information Science (Vol. 141 CCIS, pp. 118–125). https://doi.org/10.1007/978-3-642-19423-8_13

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free