Visual-speech Synthesis of Exaggerated Corrective Feedback

Yaohua Bu; Weijun Li; Tianyi Ma; Shengqi Chen; Jia Jia; Kun Li; Xiaobo Lu

Conference ProceedingsOPEN ACCESS

Visual-speech Synthesis of Exaggerated Corrective Feedback

Bu Y
Li W
Ma T
et al.

MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (2020) 4521-4523

DOI: 10.1145/3394171.3414444

1Citations

8Readers

Get full text

Abstract

To provide more discriminative feedback for the second language (L2) learners to better identify their mispronunciation, we propose a method for exaggerated visual-speech feedback in computer-assisted pronunciation training (CAPT). The speech exaggeration is realized by an emphatic speech generation neural network based on Tacotron, while the visual exaggeration is accomplished by ADC Viseme Blending, namely increasing Amplitude of movement, extending the phone's Duration and enhancing the color Contrast. User studies show that exaggerated feedback outperforms non-exaggerated version on helping learners with pronunciation identification and pronunciation improvement.

Author supplied keywords

Cite

CITATION STYLE

APA

Bu, Y., Li, W., Ma, T., Chen, S., Jia, J., Li, K., & Lu, X. (2020). Visual-speech Synthesis of Exaggerated Corrective Feedback. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 4521–4523). Association for Computing Machinery, Inc. https://doi.org/10.1145/3394171.3414444

Visual-speech Synthesis of Exaggerated Corrective Feedback

Abstract

Author supplied keywords

Cite

Register to see more suggestions