Robust Arabic multi-stream speech recognition system in noisy environment

Anissa Imen Amrous; Mohamed Debyeche

Conference ProceedingsOPEN ACCESS

Robust Arabic multi-stream speech recognition system in noisy environment

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7340 LNCS 571-578

DOI: 10.1007/978-3-642-31254-0_65

3Citations

3Readers

Abstract

In this paper, the framework of multi-stream combination has been explored to improve the noise robustness of automatic speech recognition systems. The main important issues of multi-stream systems are which features representation to combine and what importance (weights) be given to each one. Two stream features have been investigated, namely the MFCC features and a set of complementary features which consists of pitch frequency, energy and the first three formants. Empiric optimum weights are fixed for each stream. The multi-stream vectors are modeled by Hidden Markov Models (HMMs) with Gaussian Mixture Models (GMMs) state distributions. Our ASR is implemented using HTK toolkit and ARADIGIT corpus which is data base of Arabic spoken words. The obtained results show that for highly noisy speech, the proposed multi-stream vectors leads to a significant improvement in recognition accuracy. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Amrous, A. I., & Debyeche, M. (2012). Robust Arabic multi-stream speech recognition system in noisy environment. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7340 LNCS, pp. 571–578). https://doi.org/10.1007/978-3-642-31254-0_65

Robust Arabic multi-stream speech recognition system in noisy environment

Abstract

Author supplied keywords

Cite

Register to see more suggestions