Design and Evaluation of an Ultra Low-power Human-quality Speech Recognition System

3Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

Abstract

Automatic Speech Recognition (ASR) has experienced a dramatic evolution since pioneer development of Bell Lab's single-digit recognizer more than 50 years ago. Current ASR systems have taken advantage of the tremendous improvements in AI during the past decade by incorporating Deep Neural Networks into the system and pushing their accuracy to levels comparable to that of humans. This article describes and characterizes a representative ASR system with state-of-the-art accuracy and proposes a hardware platform capable of decoding speech in real-time with a power dissipation close to 1 Watt. The software is based on the so-called hybrid approach with a vocabulary of 200K words and RNN-based language model re-scoring, whereas the hardware consists of a commercially available low-power processor along with two accelerators used for the most compute-intensive tasks. The article shows that high performance can be obtained with very low power, enabling the deployment of these systems in extremely power-constrained environments such as mobile and IoT devices.

Cite

CITATION STYLE

APA

Pinto, D., Arnau, J. M., & González, A. (2020). Design and Evaluation of an Ultra Low-power Human-quality Speech Recognition System. ACM Transactions on Architecture and Code Optimization, 17(4). https://doi.org/10.1145/3425604

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free