MKD: Mixup-Based Knowledge Distillation for Mandarin End-to-End Speech Recognition

Xing Wu; Yifan Jin; Jianjia Wang; Quan Qian; Yike Guo

Journal ArticleOPEN ACCESS

MKD: Mixup-Based Knowledge Distillation for Mandarin End-to-End Speech Recognition

Algorithms (2022) 15(5)

DOI: 10.3390/a15050160

3Citations

9Readers

Abstract

Large-scale automatic speech recognition model has achieved impressive performance. However, huge computational resources and massive amount of data are required to train an ASR model. Knowledge distillation is a prevalent model compression method which transfers the knowledge from large model to small model. To improve the efficiency of knowledge distillation for end-to-end speech recognition especially in the low-resource setting, a Mixup-based Knowledge Distillation (MKD) method is proposed which combines Mixup, a data-agnostic data augmentation method, with softmax-level knowledge distillation. A loss-level mixture is presented to address the problem caused by the non-linearity of label in the KL-divergence when adopting Mixup to the teacher–student framework. It is mathematically shown that optimizing the mixture of loss function is equivalent to optimize an upper bound of the original knowledge distillation loss. The proposed MKD takes the advantage of Mixup and brings robustness to the model even with a small amount of training data. The experiments on Aishell-1 show that MKD obtains a 15.6% and 3.3% relative improvement on two student models with different parameter scales compared with the existing methods. Experiments on data efficiency demonstrate MKD achieves similar results with only half of the original dataset.

Author supplied keywords

Cite

CITATION STYLE

APA

Wu, X., Jin, Y., Wang, J., Qian, Q., & Guo, Y. (2022). MKD: Mixup-Based Knowledge Distillation for Mandarin End-to-End Speech Recognition. Algorithms, 15(5). https://doi.org/10.3390/a15050160

MKD: Mixup-Based Knowledge Distillation for Mandarin End-to-End Speech Recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions