State-of-the-art and recent progress in hybrid HMM/ANN speech recognition

0Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

It is by now very well known that for pattern classification problems Artificial Neural Networks (ANNs) can provide (good) estimates of posterior probabilities of output classes conditioned on the input. As a result of this property, a number of researchers over the last seven years have built systems, referred to as hybrid HMM/ANN systems, in which ANNs have been discriminativeiy trained to estimate emission probabilities for hidden Markov models (HMMs). Such systems have been proved, on controlled tests, to be both effective in terms of accuracy and efficient in terms of CPU and memory run-time requirements. After a short introduction to these HMM/ANN systems, a few of the extensions currently under investigation will be discussed. While initial HMM/ANN systems were discriminant only locally (at the HMM state level), the underlying theory will be briefly revisited to allow ANNs to be trained in a (unsupervised) globally discriminant manner (at the sentence level); this will also provide us with new insights into the general HMM/ANN approach. Secondly, a particular new speech recognition approach, referred to as multi-band or multi-stream speech recognition, which could directly benefit from hybrid HMM/ANN developments will be discussed. In this latter approach, the speech signal is decomposed into frequency sub-bands which are independently processed by different HMM/ANN systems (the ANN allowing to introduce more temporal information to compensate for the lack of frequency information) that are recombined in a linear or nonlinear (ANN) way at some temporal anchor points. Some of the very promising results achieved so far by this approach will be presented.

Cite

CITATION STYLE

APA

Bourlard, H. (1997). State-of-the-art and recent progress in hybrid HMM/ANN speech recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1327, pp. 876–884). Springer Verlag. https://doi.org/10.1007/bfb0020264

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free