Efficient beam pruning for speech recognition with a reward considering the potential to reach various words on a lexical tree

1Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

This paper presents efficient frame-synchronous beam pruning for HMM-based automatic speech recognition. In the conventional beam pruning, a few hypotheses that have greater potential to reach various words on a lexical tree are likely to be pruned out by a number of hypotheses that have limited potential, since all hypotheses are treated equally without considering this potential. To make the beam pruning less restrictive for hypotheses with greater potential and vice versa, the proposed method adds to the likelihood of each hypothesis a tentative reward as a monotonically increasing function of the number of reachable words from the HMM state where the hypothesis stays in a lexical tree. The reward is designed not to collapse the ASR probabilistic framework. The proposed method reduced 84% of the processing time for a grammar-based 10k-word short sentence recognition task. For a language-model-based dictation task, it also resulted in an additional 23% reduction in processing time from the beam pruning with the language model look-ahead technique. Copyright © 2011 The Institute of Electronics, Information and Communication Engineers.

Cite

CITATION STYLE

APA

Kato, T., Fujita, K., & Nishizawa, N. (2011). Efficient beam pruning for speech recognition with a reward considering the potential to reach various words on a lexical tree. IEICE Transactions on Information and Systems, E94-D(6), 1253–1259. https://doi.org/10.1587/transinf.E94.D.1253

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free