Semour: A scripted emotional speech repository for urdu

Nimra Zaheer; Obaid Ullah Ahmad; Ammar Ahmed; Muhammad Shehryar Khan; Mudassir Shabbir

Conference ProceedingsOPEN ACCESS

Semour: A scripted emotional speech repository for urdu

Conference on Human Factors in Computing Systems - Proceedings (2021)

DOI: 10.1145/3411764.3445171

7Citations

32Readers

Get full text

Abstract

Designing reliable Speech Emotion Recognition systems is a complex task that inevitably requires sufcient data for training purposes. Such extensive datasets are currently available in only a few languages, including English, German, and Italian. In this paper, we present SEMOUR, the frst scripted database of emotion-tagged speech in the Urdu language, to design an Urdu Speech Recognition System. Our gender-balanced dataset contains 15, 040 unique instances recorded by eight professional actors eliciting a syntactically complex script. The dataset is phonetically balanced, and reliably exhibits a varied set of emotions as marked by the high agreement scores among human raters in experiments. We also provide various baseline speech emotion prediction scores on the database, which could be used for various applications like personalized robot assistants, diagnosis of psychological disorders, and getting feedback from a low-tech-enabled population, etc. On a random test sample, our model correctly predicts an emotion with a state-of-the-art 92% accuracy.

Author supplied keywords

Cite

CITATION STYLE

APA

Zaheer, N., Ahmad, O. U., Ahmed, A., Khan, M. S., & Shabbir, M. (2021). Semour: A scripted emotional speech repository for urdu. In Conference on Human Factors in Computing Systems - Proceedings. Association for Computing Machinery. https://doi.org/10.1145/3411764.3445171

Semour: A scripted emotional speech repository for urdu

Abstract

Author supplied keywords

Cite

Register to see more suggestions