Speech Emotion Recognition Using openSMILE and GPT 3.5 Transformer

3Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.

Abstract

In recent years, artificial intelligence, and machine learning (ML) models have advanced significantly, offering transformative solutions across diverse sectors. Emotion recognition in speech has particularly benefited from ML techniques, revolutionizing its accuracy and applicability. This article proposes a method for emotion detection in Romanian speech analysis by combining two distinct approaches: semantic analysis using GPT Transformer and acoustic analysis using openSMILE. The results showed an accuracy of 74% and a precision of almost 82%. Several system limitations were observed due to the limited and low-quality dataset. However, it also opened a new horizon in our research by analyzing emotions to identify mental health disorders.

Cite

CITATION STYLE

APA

Turcian, D., & Stoicu-Tivadar, V. (2024). Speech Emotion Recognition Using openSMILE and GPT 3.5 Transformer. In Studies in Health Technology and Informatics (Vol. 316, pp. 924–928). IOS Press BV. https://doi.org/10.3233/SHTI240562

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free