A Fused Hidden Markov Model with Application to Bimodal Speech Processing

Hao Pan; Stephen E. Levinson; Thomas S. Huang; Zhi Pei Liang

Journal Article

A Fused Hidden Markov Model with Application to Bimodal Speech Processing

IEEE Transactions on Signal Processing (2004) 52(3) 573-581

DOI: 10.1109/TSP.2003.822353

36Citations

22Readers

Get full text

Abstract

This paper presents a novel fused hidden Markov model (fused HMM) for integrating tightly coupled time series, such as audio and visual features of speech. In this model, the time series are first modeled by two conventional HMMs separately. The resulting HMMs are then fused together using a probabilistic fusion model, which is optimal according to the maximum entropy principle and a maximum mutual information criterion. Simulations and bimodal speaker verification experiments show that the proposed model can significantly reduce the recognition errors in noiseless or noisy environments.

Author supplied keywords

Cite

CITATION STYLE

APA

Pan, H., Levinson, S. E., Huang, T. S., & Liang, Z. P. (2004). A Fused Hidden Markov Model with Application to Bimodal Speech Processing. IEEE Transactions on Signal Processing, 52(3), 573–581. https://doi.org/10.1109/TSP.2003.822353

A Fused Hidden Markov Model with Application to Bimodal Speech Processing

Abstract

Author supplied keywords

Cite

Register to see more suggestions