Fusing audio-visual nonverbal cues to detect dominant people in small group conversations

28Citations
Citations of this article
45Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper addresses the multimodal nature of social dominance and presents multimodal fusion techniques to combine audio and visual nonverbal cues for dominance estimation in small group conversations. We combine the two modalities both at the feature extraction level and at the classifier level via score and rank level fusion. The classification is done by a simple rule-based estimator. We perform experiments on a new 10-hour dataset derived from the popular AMI meeting corpus. We objectively evaluate the performance of each modality and each cue alone and in combination. Our results show that the combination of audio and visual cues is necessary to achieve the best performance. © 2010 IEEE.

Cite

CITATION STYLE

APA

Aran, O., & Gatica-Perez, D. (2010). Fusing audio-visual nonverbal cues to detect dominant people in small group conversations. In Proceedings - International Conference on Pattern Recognition (pp. 3687–3690). https://doi.org/10.1109/ICPR.2010.898

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free