Robust multi-band asr using deep neural nets and spectro-temporal features

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Spectro-temporal feature extraction and multi-band processing were both designed to make the speech recognizers more robust. Although they have been used for a long time now, very few attempts have been made to combine them. This is why here we integrate two spectro-temporal feature extraction methods into a multi-band framework. We assess the performance of our spectro-temporal feature sets both individually (as a baseline) and in combination with multi-band processing in phone recognition tasks on clean and noise contaminated versions of the TIMIT dataset. Our results show that multi-band processing clearly outperforms the baseline feature recombination method in every case tested. This improved performance can also be further enhanced by using the recently introduced technology of deep neural nets (DNNs).

Cite

CITATION STYLE

APA

Kovács, G., Tóth, L., & Grósz, T. (2014). Robust multi-band asr using deep neural nets and spectro-temporal features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8773, pp. 386–393). Springer Verlag. https://doi.org/10.1007/978-3-319-11581-8_48

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free