On creating multimodal virtual humans-real time speech driven facial gesturing

14Citations
Citations of this article
31Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Because of extensive use of different computer devices, human-computer interaction design nowadays moves towards creating user centric interfaces. It assumes incorporating different modalities that humans use in everyday communication. Virtual humans, who look and behave believably, fit perfectly in the concept of designing interfaces in more natural, effective, as well as social oriented way. In this paper we present a novel method for automatic speech driven facial gesturing for virtual humans capable of real time performance. Facial gestures included are various nods and head movements, blinks, eyebrow gestures and gaze. A mapping from speech to facial gestures is based on the prosodic information obtained from the speech signal. It is realized using a hybrid approach-Hidden Markov Models, rules and global statistics. Further, we test the method using an application prototype-a system for speech driven facial gesturing suitable for virtual presenters. Subjective evaluation of the system confirmed that the synthesized facial movements are consistent and time aligned with the underlying speech, and thus provide natural behavior of the whole face. © 2010 Springer Science+Business Media, LLC.

Cite

CITATION STYLE

APA

Zoric, G., Forchheimer, R., & Pandzic, I. S. (2011). On creating multimodal virtual humans-real time speech driven facial gesturing. Multimedia Tools and Applications, 54(1), 165–179. https://doi.org/10.1007/s11042-010-0526-y

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free