Multimodal HALEF: An open-source modular web-based multimodal dialog framework

0Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present an open-source web-based multimodal dialog framework, “Multimodal HALEF”, that integrates video conferencing and telephony abilities into the existing HALEF cloud-based dialog framework via the FreeSWITCH video telephony server. Due to its distributed and cloud-based architecture, Multimodal HALEF allows researchers to collect video and speech data from participants interacting with the dialog system outside of traditional lab settings, therefore largely reducing cost and labor incurred during the traditional audio-visual data collection process. The framework is equipped with a set of tools including a web-based user survey template, a speech transcription, an annotation and rating portal, a web visual processing server that performs head tracking, and a database that logs full-call audio and video recordings as well as other call-specific information. We present observations from an initial data collection based on an job interview application. Finally we report on some future plans for development of the framework.

Author supplied keywords

Cite

CITATION STYLE

APA

Yu, Z., Ramanarayanan, V., Mundkowsky, R., Lange, P., Ivanov, A., Black, A. W., & Suendermann-Oeft, D. (2017). Multimodal HALEF: An open-source modular web-based multimodal dialog framework. In Lecture Notes in Electrical Engineering (Vol. 427 427 LNEE, pp. 233–244). Springer Verlag. https://doi.org/10.1007/978-981-10-2585-3_18

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free