Entropy-Based Dynamic Rescoring with Language Model in E2E ASR Systems

0Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Language models (LM) have played crucial roles in automatic speech recognition (ASR), whether as an essential part of a conventional ASR system composed of an acoustic model and LM, or as an integrated model to enhance the performance of novel end-to-end ASR systems. With the development of machine learning and deep learning, language modeling has made great progress in natural language processing applications. In recent years, efforts have been made to leverage the advantages of novel LM to ASR. The most common way to apply an integration is still shallow fusion because it can be easily implemented by zero-overhead while obtaining significant improvement. Our method can further enhance the applicability of shallow fusion without hyperparameter tuning while maintaining similar performance.

Cite

CITATION STYLE

APA

Gong, Z., Saito, D., & Minematsu, N. (2022). Entropy-Based Dynamic Rescoring with Language Model in E2E ASR Systems. Applied Sciences (Switzerland), 12(19). https://doi.org/10.3390/app12199690

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free