We have come to the point that one of the important aspects of the process speech emotion recognition is the gender classification. The correct classification of gender will improve the performance of Speech Emotion Recognition (SER) system towards its robustness. Here, we are specifically referring to Gammatone Mel Frequency Cepstral Coefficient (GMFCC) as a feature extraction method that extracts features from IITKGP-SESHC dataset, which is very crucial in deciding either male or female in gender classification. The well known classifier “Deep Long Short Term Memory (Deep LSTM)” is itself an important kind of Recurrent Neural Network (RNN) that handles the long-range dependencies more efficiently than the RNNs. The GMFCC feature has to pass through the Deep LSTM and get average percent gender identification accuracy of 98.3%.
CITATION STYLE
Kumar*, S., & Yadav, J. (2019). Gender Classification for Emotional Speech using GMFCC and Deep LSTM. International Journal of Innovative Technology and Exploring Engineering, 9(2), 3923–2928. https://doi.org/10.35940/ijitee.a6109.129219
Mendeley helps you to discover research relevant for your work.