Spectro-temporal directional derivative based automatic speech recognition for a serious game scenario

13Citations
Citations of this article
30Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Speech is one of the important modalities in a serious game platform. Serious game can be very useful for the rehabilitation of individuals with voice disorders. Therefore, we need an efficient and high-performance automatic speech recognition (ASR) system. In this paper, we propose a spectro-temporal directional derivative (STDD) feature that requires less number of computations in the modeling and yet gives high recognition accuracy in the ASR system. The proposed STDD feature is achieved by applying different directional derivative filters in the spectro-temporal domain. The feature dimension is then compressed by discrete cosine transform. The experiments are performed with voice samples of Arabic numerals spoken by persons with and without voice pathology. The experimental results show that the STDD feature outperforms the conventional mel-frequency cepstral coefficients both in clean and noisy environments.

Cite

CITATION STYLE

APA

Muhammad, G., Masud, M., Alelaiwi, A., Rahman, M. A., Karime, A., Alamri, A., & Hossain, M. S. (2015). Spectro-temporal directional derivative based automatic speech recognition for a serious game scenario. Multimedia Tools and Applications, 74(14), 5313–5327. https://doi.org/10.1007/s11042-014-1973-7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free