An analysis of emotional speech recognition for tamil language using deep learning gate recurrent unit

5Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

Designing the interaction among human language and a registered emotional database enables us to explore how the system performs and has multiple approaches for emotion detection in patient services. As of now, clustering techniques were primarily used in many prominent areas and in emotional speech recognition, even though it shows best results a new approach to the design is focused on Long Short-Term Memory (LSTM), Bi-Directional LSTM and Gated Recurrent Unit (GRU) as an estimation method for emotional Tamil datasets is available in this paper. A new approach of Deep Hierarchal LSTM/BiLSTM/GRU layer is designed to obtain the best result for long term learning voice dataset. Different combinations of deep learning hierarchal architecture like LSTM & GRU (DHLG), BiLSTM & GRU (DHBG), GRU & LSTM (DHGL), GRU & BiLSTM (DHGB) and dual GRU (DHGG) layer is designed with introduction of dropout layer to overcome the learning problem and gradient vanishing issues in emotional speech recognition. Moreover, to increase the design outcome within each emotional speech signal, various feature extraction combinations are utilized. From the analysis an average classification validity of the proposed DHGB model gives 82.86%, which is slightly higher than other models like DHGL (82.58), DHBG (82%), DHLG (81.14%) and DHGG (80%). Thus, by comparing all the models DHGB gives prominent outcome of 5% more than other four models with minimum training time and low dataset.

References Powered by Scopus

Long Short-Term Memory

77222Citations
N/AReaders
Get full text

Convolutional neural networks for speech recognition

1970Citations
N/AReaders
Get full text

Hybrid speech recognition with Deep Bidirectional LSTM

1555Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network

8Citations
N/AReaders
Get full text

A Review of the Advancement in Speech Emotion Recognition for Indo-Aryan and Dravidian Languages

6Citations
N/AReaders
Get full text

Attention-based Spatialized Word Embedding Bi-LSTM Model for Sentiment Analysis

4Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Fernandes, B., & Mannepalli, K. (2021). An analysis of emotional speech recognition for tamil language using deep learning gate recurrent unit. Pertanika Journal of Science and Technology, 29(3), 1937–1961. https://doi.org/10.47836/pjst.29.3.37

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 2

50%

Lecturer / Post doc 1

25%

Researcher 1

25%

Readers' Discipline

Tooltip

Computer Science 2

50%

Engineering 2

50%

Save time finding and organizing research with Mendeley

Sign up for free