EdgeCRNN: an edge-computing oriented model of acoustic feature enhancement for keyword spotting

Yungen Wei; Zheng Gong; Shunzhi Yang; Kai Ye; Yamin Wen

Journal ArticleOPEN ACCESS

EdgeCRNN: an edge-computing oriented model of acoustic feature enhancement for keyword spotting

Journal of Ambient Intelligence and Humanized Computing (2022) 13(3) 1525-1535

DOI: 10.1007/s12652-021-03022-1

12Citations

17Readers

Get full text

Abstract

Keyword Spotting (KWS) is a significant branch of Automatic Speech Recognition (ASR) and has been widely used in edge computing devices. The goal of KWS is to provide high accuracy with a low False Alarm Rate (FAR), while reducing the costs of memory, computation, and latency. However, limited resources are challenging for KWS applications on edge computing devices. Lightweight models and structures for deep learning have achieved good results in the KWS branch while maintaining efficient performances. In this paper, we present a new Convolutional Recurrent Neural Network (CRNN) architecture named EdgeCRNN for edge computing devices. EdgeCRNN, which is based on depthwise separable convolution and residual structure, uses a feature enhanced method. On the Google Speech Commands Dataset, the experimental results depict that EdgeCRNN can test 11.1 audio data per second on Raspberry Pi 3B+, which is 2.2 times than that of Tpool2. Compared with Tpool2, the accuracy of EdgeCRNN reaches 98.05% whilst its performance is also competitive.

Author supplied keywords

Cite

CITATION STYLE

APA

Wei, Y., Gong, Z., Yang, S., Ye, K., & Wen, Y. (2022). EdgeCRNN: an edge-computing oriented model of acoustic feature enhancement for keyword spotting. Journal of Ambient Intelligence and Humanized Computing, 13(3), 1525–1535. https://doi.org/10.1007/s12652-021-03022-1

EdgeCRNN: an edge-computing oriented model of acoustic feature enhancement for keyword spotting

Abstract

Author supplied keywords

Cite

Register to see more suggestions