A sequence training method for deep rectifier neural networks in speech recognition

Tamás Grósz; Gábor Gosztolya; László Tóth

Conference Proceedings

A sequence training method for deep rectifier neural networks in speech recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8773 81-88

DOI: 10.1007/978-3-319-11581-8_10

7Citations

9Readers

Get full text

Abstract

While Hidden Markov Modeling (HMM) has been the dominant technology in speech recognition for many decades, recently deep neural networks (DNN) it seems have now taken over. The current DNN technology requires frame-aligned labels, which are usually created by first training an HMM system. Obviously, it would be desirable to have ways of training DNN-based recognizers without the need to create an HMM to do the same task. Here, we evaluate one such method which is called Connectionist Temporal Classification (CTC). Though it was originally proposed for the training of recurrent neural networks, here we show that it can also be used to train more conventional feed-forward networks as well. In the experimental part, we evaluate the method on standard phone recognition tasks. For all three databases we tested, we found that the CTC method gives slightly better results that those obtained with force-aligned training labels got using an HMM system.

Author supplied keywords

Cite

CITATION STYLE

APA

Grósz, T., Gosztolya, G., & Tóth, L. (2014). A sequence training method for deep rectifier neural networks in speech recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8773, pp. 81–88). Springer Verlag. https://doi.org/10.1007/978-3-319-11581-8_10

A sequence training method for deep rectifier neural networks in speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions