Abstract
Minimum Phone Error (MPE) is an objective function for discriminative training of acoustic models for speech recognition. Recently several different objective functions related to MPE have been proposed. In this paper we compare implementations of three of these to MPE on English and Arabic broadcast news. The techniques investigated are Minimum Phone Frame Error (MPFE), Minimum Divergence (MD), and a physical-state level version of Minimum Bayes Risk which we call s-MBR. In the case of MPFE we observe improvements over MPE. We propose that the smoothing constant used in MPE should be scaled according to the average value of the counts in the statistics obtained from these objective functions. © 2007 IEEE.
Author supplied keywords
Cite
CITATION STYLE
Povey, D., & Kingsbury, B. (2007). Evaluation of proposed modifications to MPE for large scale discriminative training. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 4). https://doi.org/10.1109/ICASSP.2007.366914
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.