A support vector machine-based dynamic network for visual speech recognition applications

Mihaela Gordan; Constantine Kotropoulos; Ioannis Pitas

Journal ArticleOPEN ACCESS

A support vector machine-based dynamic network for visual speech recognition applications

Eurasip Journal on Applied Signal Processing (2002) 2002(11) 1248-1259

DOI: 10.1155/S1110865702207039

37Citations

12Readers

Get full text

Abstract

Visual speech recognition is an emerging research field. In this paper, we examine the suitability of support vector machines for visual speech recognition. Each word is modeled as a temporal sequence of visemes corresponding to the different phones realized. One support vector machine is trained to recognize each viseme and its output is converted to a posterior probability through a sigmoidal mapping. To model the temporal character of speech, the support vector machines are integrated as nodes into a Viterbi lattice. We test the performance of the proposed approach on a small visual speech recognition task, namely the recognition of the first four digits in English. The word recognition rate obtained is at the level of the previous best reported rates.

Author supplied keywords

Cite

CITATION STYLE

APA

Gordan, M., Kotropoulos, C., & Pitas, I. (2002). A support vector machine-based dynamic network for visual speech recognition applications. Eurasip Journal on Applied Signal Processing, 2002(11), 1248–1259. https://doi.org/10.1155/S1110865702207039

A support vector machine-based dynamic network for visual speech recognition applications

Abstract

Author supplied keywords

Cite

Register to see more suggestions