Phonetic variation modeling and a language model adaptation for korean english code-switching speech recognition

3Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we propose a new method for code-switching (CS) automatic speech recognition (ASR) in Korean. First, the phonetic variations in English pronunciation spoken by Korean speakers should be considered. Thus, we tried to find a unified pronunciation model based on phonetic knowledge and deep learning. Second, we extracted the CS sentences semantically similar to the target domain and then applied the language model (LM) adaptation to solve the biased modeling toward Korean due to the imbalanced training data. In this experiment, training data were AI Hub (1033 h) in Korean and Librispeech (960 h) in English. As a result, when compared to the baseline, the proposed method improved the error reduction rate (ERR) by up to 11.6% with phonetic variant modeling and by 17.3% when semantically similar sentences were applied to the LM adaptation. If we considered only English words, the word correction rate improved up to 24.2% compared to that of the baseline. The proposed method seems to be very effective in CS speech recognition.

Cite

CITATION STYLE

APA

Lee, D., Kim, D., Yun, S., & Kim, S. (2021). Phonetic variation modeling and a language model adaptation for korean english code-switching speech recognition. Applied Sciences (Switzerland), 11(6). https://doi.org/10.3390/app11062866

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free