LSTM RNN-based Korean Speech Recognition System Using CTC

Donghyun Lee; Minkyu Lim; Hosung Park; Ji-Hwan Kim

Journal ArticleOPEN ACCESS

LSTM RNN-based Korean Speech Recognition System Using CTC

Lee D
Lim M
Park H
et al.

Journal of Digital Contents Society (2017) 18(1) 93-99

DOI: 10.9728/dcs.2017.18.1.93

N/ACitations

9Readers

Abstract

A hybrid approach using Long Short Term Memory (LSTM) Recurrent Neural Network (RNN) has showed great improvement in speech recognition accuracy. For training acoustic model based on hybrid approach, it requires forced alignment of HMM state sequence from Gaussian Mixture Model (GMM)-Hidden Markov Model (HMM). However, high computation time for training GMM-HMM is required. This paper proposes an end-to-end approach for LSTM RNN-based Korean speech recognition to improve learning speed. A Connectionist Temporal Classification (CTC) algorithm is proposed to implement this approach. The proposed method showed almost equal performance in recognition rate, while the learning speed is 1.27 times faster.

Cite

CITATION STYLE

APA

Lee, D., Lim, M., Park, H., & Kim, J.-H. (2017). LSTM RNN-based Korean Speech Recognition System Using CTC. Journal of Digital Contents Society, 18(1), 93–99. https://doi.org/10.9728/dcs.2017.18.1.93

LSTM RNN-based Korean Speech Recognition System Using CTC

Abstract

Cite

Register to see more suggestions