Double Fourier analysis for Emotion Identification in Voiced Speech

D. Sierra-Sosa; M. Bastidas; P. D. Ortiz; O. L. Quintero

Conference ProceedingsOPEN ACCESS

Double Fourier analysis for Emotion Identification in Voiced Speech

Journal of Physics: Conference Series (2016) 705(1)

DOI: 10.1088/1742-6596/705/1/012035

4Citations

18Readers

Abstract

We propose a novel analysis alternative, based on two Fourier Transforms for emotion recognition from speech. Fourier analysis allows for display and synthesizes different signals, in terms of power spectral density distributions. A spectrogram of the voice signal is obtained performing a short time Fourier Transform with Gaussian windows, this spectrogram portraits frequency related features, such as vocal tract resonances and quasi-periodic excitations during voiced sounds. Emotions induce such characteristics in speech, which become apparent in spectrogram time-frequency distributions. Later, the signal time-frequency representation from spectrogram is considered an image, and processed through a 2-dimensional Fourier Transform in order to perform the spatial Fourier analysis from it. Finally features related with emotions in voiced speech are extracted and presented.

Cite

CITATION STYLE

APA

Sierra-Sosa, D., Bastidas, M., Ortiz, P. D., & Quintero, O. L. (2016). Double Fourier analysis for Emotion Identification in Voiced Speech. In Journal of Physics: Conference Series (Vol. 705). Institute of Physics Publishing. https://doi.org/10.1088/1742-6596/705/1/012035

Double Fourier analysis for Emotion Identification in Voiced Speech

Abstract

Cite

Register to see more suggestions