In this paper, the opening work on the development of a Lithuanian HMM speech recognition system is described. The triphone single-Gaussian HMM speech recognition system based on Mel Frequency Cepstral Coefficients (MFCC) was developed using HTK toolkit. Hidden Markov model's parameters were estimated from phone-level hand-annotated Lithuanian speech corpus. The system was evaluated on a speaker-independent ∼ 750 distinct isolated-word recognition task. Though the speaker adaptation and language modeling techniques were not used, the system was performing at 20% word error rate.
Mendeley helps you to discover research relevant for your work.
CITATION STYLE
Raškinis, G., & Raškiniene, D. (2003). Building Medium-Vocabulary Isolated-Word Lithuanian HMM Speech Recognition System. Informatica, 14(1), 75–84. https://doi.org/10.15388/informatica.2003.005