Audio-visual isolated words recognition for voice dialogue system

Josef Chaloupka

Conference Proceedings

Audio-visual isolated words recognition for voice dialogue system

Chaloupka J

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6800 LNCS 88-94

DOI: 10.1007/978-3-642-25775-9_8

1Citations

5Readers

Get full text

Abstract

This contribution is about experiments in audio-visual isolated words recognition. The results of these experiments will be used to improve our voice dialogue system, where visual speech recognition will be added. The voice dialogue systems can be used in train or bus stations (or elsewhere), where noise levels are relatively high, therefore the visual part of speech can improve the recognition rate mainly in noisy conditions. The audio-visual recognition of isolated words in our experiments was based on the technique of two-stream Hidden Markov Models (HMM) and on the HMM of single Czech phonemes and visemes. Different visual speech features and a different number of states and mixtures of HMM were evaluated in single tests. In the following experiments, isolated words were being recognized after training of the HMM and babble noise was added in the successive steps to the acoustic speech signal. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Chaloupka, J. (2011). Audio-visual isolated words recognition for voice dialogue system. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6800 LNCS, pp. 88–94). https://doi.org/10.1007/978-3-642-25775-9_8

Audio-visual isolated words recognition for voice dialogue system

Abstract

Author supplied keywords

Cite

Register to see more suggestions