Phoneme Set design based on integrated acoustic and linguistic features for second language speech recognition

Xiaoyun Wang; Tsuneo Kato; Seiichi Yamamoto

Journal ArticleOPEN ACCESS

Phoneme Set design based on integrated acoustic and linguistic features for second language speech recognition

IEICE Transactions on Information and Systems (2017) E100D(4) 857-864

DOI: 10.1587/transinf.2016EDP7207

3Citations

14Readers

Abstract

Recognition of second language (L2) speech is a challenging task even for state-of-The-Art automatic speech recognition (ASR) systems, partly because pronunciation by L2 speakers is usually significantly influenced by the mother tongue of the speakers. Considering that the expressions of non-native speakers are usually simpler than those of native ones, and that second language speech usually includes mispronunciation and less fluent pronunciation, we propose a novel method that maximizes unified acoustic and linguistic objective function to derive a phoneme set for second language speech recognition. The authors verify the efficacy of the proposed method using second language speech collected with a translation game type dialogue-based computer assisted language learning (CALL) system. In this paper, the authors examine the performance based on acoustic likelihood, linguistic discrimination ability and integrated objective function for second language speech. Experiments demonstrate the validity of the phoneme set derived by the proposed method.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, X., Kato, T., & Yamamoto, S. (2017). Phoneme Set design based on integrated acoustic and linguistic features for second language speech recognition. IEICE Transactions on Information and Systems, E100D(4), 857–864. https://doi.org/10.1587/transinf.2016EDP7207

Phoneme Set design based on integrated acoustic and linguistic features for second language speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions