Robust multi-band asr using deep neural nets and spectro-temporal features

György Kovács; László Tóth; Tamás Grósz

Conference Proceedings

Robust multi-band asr using deep neural nets and spectro-temporal features

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8773 386-393

DOI: 10.1007/978-3-319-11581-8_48

2Citations

5Readers

Get full text

Abstract

Spectro-temporal feature extraction and multi-band processing were both designed to make the speech recognizers more robust. Although they have been used for a long time now, very few attempts have been made to combine them. This is why here we integrate two spectro-temporal feature extraction methods into a multi-band framework. We assess the performance of our spectro-temporal feature sets both individually (as a baseline) and in combination with multi-band processing in phone recognition tasks on clean and noise contaminated versions of the TIMIT dataset. Our results show that multi-band processing clearly outperforms the baseline feature recombination method in every case tested. This improved performance can also be further enhanced by using the recently introduced technology of deep neural nets (DNNs).

Author supplied keywords

Cite

CITATION STYLE

APA

Kovács, G., Tóth, L., & Grósz, T. (2014). Robust multi-band asr using deep neural nets and spectro-temporal features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8773, pp. 386–393). Springer Verlag. https://doi.org/10.1007/978-3-319-11581-8_48

Robust multi-band asr using deep neural nets and spectro-temporal features

Abstract

Author supplied keywords

Cite

Register to see more suggestions