Design of speech recognition engine

Luděk Müller; Josef Psutka; Luboš Šmídl

Conference Proceedings

Design of speech recognition engine

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2000) 1902 259-264

DOI: 10.1007/3-540-45323-7_44

14Citations

4Readers

Get full text

Abstract

This paper concerns a speaker independent recognition engine of Czech continuous speech designed for Czech telephone applications and describes the recognition module as an important component of a telephone dialogue system being designed and constructed at the Department of Cybernetics, the University of West Bohemia. The recognition is based on a statistical approach. The leftto-right three-state HMMs with an output probability density function expressed as multivariate Gaussian mixture are used to model triphones as basic units in acoustic modelling and stochastic regular grammars are implemented to reduce a task perplexity. Areal time recognition process is supported by a very computation cost reduction approach estimating log-likelihood scores of Gaussian mixtures and also by a beam pruning used during Viterbi decoding. The present paper concerns the main part of the engine–a speaker independent recognition engine for continuous Czech speech.

Cite

CITATION STYLE

APA

Müller, L., Psutka, J., & Šmídl, L. (2000). Design of speech recognition engine. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1902, pp. 259–264). Springer Verlag. https://doi.org/10.1007/3-540-45323-7_44

Design of speech recognition engine

Abstract

Cite

Register to see more suggestions