This paper presents efficient frame-synchronous beam pruning for HMM-based automatic speech recognition. In the conventional beam pruning, a few hypotheses that have greater potential to reach various words on a lexical tree are likely to be pruned out by a number of hypotheses that have limited potential, since all hypotheses are treated equally without considering this potential. To make the beam pruning less restrictive for hypotheses with greater potential and vice versa, the proposed method adds to the likelihood of each hypothesis a tentative reward as a monotonically increasing function of the number of reachable words from the HMM state where the hypothesis stays in a lexical tree. The reward is designed not to collapse the ASR probabilistic framework. The proposed method reduced 84% of the processing time for a grammar-based 10k-word short sentence recognition task. For a language-model-based dictation task, it also resulted in an additional 23% reduction in processing time from the beam pruning with the language model look-ahead technique. Copyright © 2011 The Institute of Electronics, Information and Communication Engineers.
CITATION STYLE
Kato, T., Fujita, K., & Nishizawa, N. (2011). Efficient beam pruning for speech recognition with a reward considering the potential to reach various words on a lexical tree. IEICE Transactions on Information and Systems, E94-D(6), 1253–1259. https://doi.org/10.1587/transinf.E94.D.1253
Mendeley helps you to discover research relevant for your work.