Isolated word speech recognition system using deep neural networks

16Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Speech recognition is the process of converting speech signals into words. For acoustic modeling HMM-GMM is used for many years. For GMM, it requires assumptions near the data distribution for calculating probabilities. For removing this limitation, GMM is replaced by DNN in acoustic model. Deep neural networks are the feed forward neural networks having more than one or multiple layers of hidden units. In this work, we have presented the isolated word speech recognition system using acoustic model of HMM and DNN. We are using Deep Belief Network pre-training algorithm for initializing deep neural networks. DBN is a multilayer generative probabilistic model with large number of stochastic binary units. The features used are the mel-frequency cepstrum coefficients (MFCC). Experimental results are calculated on TI digits database. Proposed system has achieved 86.06% accuracy on TI digits database. System accuracy can be further increased by increasing the number of hidden units.

Cite

CITATION STYLE

APA

Dhanashri, D., & Dhonde, S. B. (2017). Isolated word speech recognition system using deep neural networks. In Advances in Intelligent Systems and Computing (Vol. 468, pp. 9–17). Springer Verlag. https://doi.org/10.1007/978-981-10-1675-2_2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free