Phonetic variation modeling and a language model adaptation for korean english code-switching speech recognition

Damheo Lee; Donghyun Kim; Seung Yun; Sanghun Kim

Journal ArticleOPEN ACCESS

Phonetic variation modeling and a language model adaptation for korean english code-switching speech recognition

Applied Sciences (Switzerland) (2021) 11(6)

DOI: 10.3390/app11062866

3Citations

11Readers

Abstract

In this paper, we propose a new method for code-switching (CS) automatic speech recognition (ASR) in Korean. First, the phonetic variations in English pronunciation spoken by Korean speakers should be considered. Thus, we tried to find a unified pronunciation model based on phonetic knowledge and deep learning. Second, we extracted the CS sentences semantically similar to the target domain and then applied the language model (LM) adaptation to solve the biased modeling toward Korean due to the imbalanced training data. In this experiment, training data were AI Hub (1033 h) in Korean and Librispeech (960 h) in English. As a result, when compared to the baseline, the proposed method improved the error reduction rate (ERR) by up to 11.6% with phonetic variant modeling and by 17.3% when semantically similar sentences were applied to the LM adaptation. If we considered only English words, the word correction rate improved up to 24.2% compared to that of the baseline. The proposed method seems to be very effective in CS speech recognition.

Author supplied keywords

Cite

CITATION STYLE

APA

Lee, D., Kim, D., Yun, S., & Kim, S. (2021). Phonetic variation modeling and a language model adaptation for korean english code-switching speech recognition. Applied Sciences (Switzerland), 11(6). https://doi.org/10.3390/app11062866

Phonetic variation modeling and a language model adaptation for korean english code-switching speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions