Abstract
A hybrid approach using Long Short Term Memory (LSTM) Recurrent Neural Network (RNN) has showed great improvement in speech recognition accuracy. For training acoustic model based on hybrid approach, it requires forced alignment of HMM state sequence from Gaussian Mixture Model (GMM)-Hidden Markov Model (HMM). However, high computation time for training GMM-HMM is required. This paper proposes an end-to-end approach for LSTM RNN-based Korean speech recognition to improve learning speed. A Connectionist Temporal Classification (CTC) algorithm is proposed to implement this approach. The proposed method showed almost equal performance in recognition rate, while the learning speed is 1.27 times faster.
Cite
CITATION STYLE
Lee, D., Lim, M., Park, H., & Kim, J.-H. (2017). LSTM RNN-based Korean Speech Recognition System Using CTC. Journal of Digital Contents Society, 18(1), 93–99. https://doi.org/10.9728/dcs.2017.18.1.93
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.