The automatic speech recognizor (ASR) based on hidden Markov models (HMMs) is very sensitive to multi-talker, non-stationary babble noise, which consists of a large number of speakers talking simultaneously. One major reason is due to mismatches between the training and testing conditions, which makes the accurate parameters of the HMM incapable of describing the uncertain distributions of the observations in speech signals. This paper applies one extension of the HMM referred to as the type-2 fuzzy hidden Markov models (T2 FHMMs) to modeling uncertain speech sequences. More specifically, we use the type2 fuzzy set (T2 FS) to describe uncertain parameters of the HMM that may vary anywhere in an interval with uniform possibilities. As a result, the likelihood of the T2 FHMM becomes an interval rather than a precise real number, which can be processed by the generalized linear model (GLM) for final classification decision-making. Experimental results of phoneme classification in the babble noise demonstrate a significant improvement compared with the HMM in terms of the robustness and classification rate. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Cao, X. Q., Zeng, J., & Yan, H. (2007). Modeling uncertain speech sequences using type-2 fuzzy hidden markov models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4810 LNCS, pp. 315–324). Springer Verlag. https://doi.org/10.1007/978-3-540-77255-2_34
Mendeley helps you to discover research relevant for your work.