An analysis of emotional speech recognition for tamil language using deep learning gate recurrent unit

Bennilo Fernandes; Kasiprasad Mannepalli

Journal ArticleOPEN ACCESS

An analysis of emotional speech recognition for tamil language using deep learning gate recurrent unit

Pertanika Journal of Science and Technology (2021) 29(3) 1937-1961

DOI: 10.47836/pjst.29.3.37

5Citations

10Readers

Abstract

Designing the interaction among human language and a registered emotional database enables us to explore how the system performs and has multiple approaches for emotion detection in patient services. As of now, clustering techniques were primarily used in many prominent areas and in emotional speech recognition, even though it shows best results a new approach to the design is focused on Long Short-Term Memory (LSTM), Bi-Directional LSTM and Gated Recurrent Unit (GRU) as an estimation method for emotional Tamil datasets is available in this paper. A new approach of Deep Hierarchal LSTM/BiLSTM/GRU layer is designed to obtain the best result for long term learning voice dataset. Different combinations of deep learning hierarchal architecture like LSTM & GRU (DHLG), BiLSTM & GRU (DHBG), GRU & LSTM (DHGL), GRU & BiLSTM (DHGB) and dual GRU (DHGG) layer is designed with introduction of dropout layer to overcome the learning problem and gradient vanishing issues in emotional speech recognition. Moreover, to increase the design outcome within each emotional speech signal, various feature extraction combinations are utilized. From the analysis an average classification validity of the proposed DHGB model gives 82.86%, which is slightly higher than other models like DHGL (82.58), DHBG (82%), DHLG (81.14%) and DHGG (80%). Thus, by comparing all the models DHGB gives prominent outcome of 5% more than other four models with minimum training time and low dataset.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Fernandes, B., & Mannepalli, K. (2021). An analysis of emotional speech recognition for tamil language using deep learning gate recurrent unit. Pertanika Journal of Science and Technology, 29(3), 1937–1961. https://doi.org/10.47836/pjst.29.3.37

Readers' Seniority

PhD / Post grad / Masters / Doc 2

50%

Lecturer / Post doc 1

25%

Researcher 1

25%

Readers' Discipline

Computer Science 2

50%

Engineering 2

50%

An analysis of emotional speech recognition for tamil language using deep learning gate recurrent unit

Abstract

Author supplied keywords

References Powered by Scopus

Long Short-Term Memory

Convolutional neural networks for speech recognition

Hybrid speech recognition with Deep Bidirectional LSTM

Cited by Powered by Scopus

Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network

A Review of the Advancement in Speech Emotion Recognition for Indo-Aryan and Dravidian Languages

Attention-based Spatialized Word Embedding Bi-LSTM Model for Sentiment Analysis

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline