Speech silicon: An FPGA architecture for real-time Hidden Markov-model-based speech recognition

Jeffrey Schuster; Kshitij Gupta; Raymond Hoare; Alex K. Jones

Journal ArticleOPEN ACCESS

Speech silicon: An FPGA architecture for real-time Hidden Markov-model-based speech recognition

Eurasip Journal on Embedded Systems (2006) 2006

DOI: 10.1155/ES/2006/48085

13Citations

20Readers

Abstract

This paper examines the design of an FPGA-based system-on-a-chip capable of performing continuous speech recognition on medium sized vocabularies in real time. Through the creation of three dedicated pipelines, one for each of the major operations in the system, we were able to maximize the throughput of the system while simultaneously minimizing the number of pipeline stalls in the system. Further, by implementing a token-passing scheme between the later stages of the system, the complexity of the control was greatly reduced and the amount of active data present in the system at any time was minimized. Additionally, through in-depth analysis of the SPHINX 3 large vocabulary continuous speech recognition engine, we were able to design models that could be efficiently benchmarked against a known software platform. These results, combined with the ability to reprogram the system for different recognition tasks, serve to create a system capable of performing real-time speech recognition in a vast array of environments.

Cite

CITATION STYLE

APA

Schuster, J., Gupta, K., Hoare, R., & Jones, A. K. (2006). Speech silicon: An FPGA architecture for real-time Hidden Markov-model-based speech recognition. Eurasip Journal on Embedded Systems, 2006. https://doi.org/10.1155/ES/2006/48085

Speech silicon: An FPGA architecture for real-time Hidden Markov-model-based speech recognition

Abstract

Cite

Register to see more suggestions