An Improved DNN-based Approach to Mispronunciation Detection and Diagnosis of L2 Learners’ Speech

Wenping Hu; Yao Qian; Frank K. Soong

Conference Proceedings

An Improved DNN-based Approach to Mispronunciation Detection and Diagnosis of L2 Learners’ Speech

Speech and Language Technology in Education, SLaTE 2015 (2015) 71-76

DOI: 10.21437/slate.2015-13

49Citations

29Readers

Get full text

Abstract

We extend the Goodness of Pronunciation (GOP) algorithm from the conventional GMM-HMM to DNN-HMM and further optimize the GOP measure toward L2 language learners’ accented speech. We evaluate the performance of the new proposed approach at phone-level mispronunciation detection and diagnosis on an L2 English learners’ corpus. Experimental results show that the Equal Error Rate (EER) is improved from 32.9% to 27.0% by extending GOP from GMM-HMM to DNN-HMM and the EER can be further improved by another 1.5% to 25.5% with our optimized GOP measure. For phone mispronunciation diagnosis, by applying our optimized DNN based GOP measure, the top-1 error rate is reduced from 61.0% to 51.4%, compared with the original DNN based one, and the top-5 error rate is reduced from 8.4% to 5.2%. On a continuously read, L2 Mandarin learners’ corpus, our approaches also achieve similar improvements.

Author supplied keywords

Cite

CITATION STYLE

APA

Hu, W., Qian, Y., & Soong, F. K. (2015). An Improved DNN-based Approach to Mispronunciation Detection and Diagnosis of L2 Learners’ Speech. In Speech and Language Technology in Education, SLaTE 2015 (pp. 71–76). The International Society for Computers and Their Applications (ISCA). https://doi.org/10.21437/slate.2015-13

An Improved DNN-based Approach to Mispronunciation Detection and Diagnosis of L2 Learners’ Speech

Abstract

Author supplied keywords

Cite

Register to see more suggestions