LSTM RNN-based Korean Speech Recognition System Using CTC

  • Lee D
  • Lim M
  • Park H
  • et al.
N/ACitations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

A hybrid approach using Long Short Term Memory (LSTM) Recurrent Neural Network (RNN) has showed great improvement in speech recognition accuracy. For training acoustic model based on hybrid approach, it requires forced alignment of HMM state sequence from Gaussian Mixture Model (GMM)-Hidden Markov Model (HMM). However, high computation time for training GMM-HMM is required. This paper proposes an end-to-end approach for LSTM RNN-based Korean speech recognition to improve learning speed. A Connectionist Temporal Classification (CTC) algorithm is proposed to implement this approach. The proposed method showed almost equal performance in recognition rate, while the learning speed is 1.27 times faster.

Cite

CITATION STYLE

APA

Lee, D., Lim, M., Park, H., & Kim, J.-H. (2017). LSTM RNN-based Korean Speech Recognition System Using CTC. Journal of Digital Contents Society, 18(1), 93–99. https://doi.org/10.9728/dcs.2017.18.1.93

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free