Double Fourier analysis for Emotion Identification in Voiced Speech

4Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We propose a novel analysis alternative, based on two Fourier Transforms for emotion recognition from speech. Fourier analysis allows for display and synthesizes different signals, in terms of power spectral density distributions. A spectrogram of the voice signal is obtained performing a short time Fourier Transform with Gaussian windows, this spectrogram portraits frequency related features, such as vocal tract resonances and quasi-periodic excitations during voiced sounds. Emotions induce such characteristics in speech, which become apparent in spectrogram time-frequency distributions. Later, the signal time-frequency representation from spectrogram is considered an image, and processed through a 2-dimensional Fourier Transform in order to perform the spatial Fourier analysis from it. Finally features related with emotions in voiced speech are extracted and presented.

Cite

CITATION STYLE

APA

Sierra-Sosa, D., Bastidas, M., Ortiz, P. D., & Quintero, O. L. (2016). Double Fourier analysis for Emotion Identification in Voiced Speech. In Journal of Physics: Conference Series (Vol. 705). Institute of Physics Publishing. https://doi.org/10.1088/1742-6596/705/1/012035

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free