Emotion detection from speech to enrich multimedia content

Feng Yu; Eric Chang; Ying Qing Xu; Heung Yeung Shum

Conference Proceedings

Emotion detection from speech to enrich multimedia content

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2001) 2195 550-557

DOI: 10.1007/3-540-45453-5_71

71Citations

72Readers

Get full text

Abstract

This paper describes an experimental study on the detection of emotion from speech. As computer-based characters such as avatars and virtual chat faces become more common, the use of emotion to drive the expression of the virtual characters becomes more important. This study utilizes a corpus containing emotional speech with 721 short utterances expressing four emotions: anger, happiness, sadness, and the neutral (unemotional) state, which were captured manually from movies and teleplays. We introduce a new concept to evaluate emotions in speech. Emotions are so complex that most speech sentences cannot be precisely assigned to a particular emotion category; however, most emotional states nevertheless can be described as a mixture of multiple emotions. Based on this concept we have trained SVMs (support vector machines) to recognize utterances within these four categories and developed an agent that can recognize and express emotions.

Cite

CITATION STYLE

APA

Yu, F., Chang, E., Xu, Y. Q., & Shum, H. Y. (2001). Emotion detection from speech to enrich multimedia content. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2195, pp. 550–557). Springer Verlag. https://doi.org/10.1007/3-540-45453-5_71

Emotion detection from speech to enrich multimedia content

Abstract

Cite

Register to see more suggestions