Multimodal HALEF: An open-source modular web-based multimodal dialog framework

Zhou Yu; Vikram Ramanarayanan; Robert Mundkowsky; Patrick Lange; Alexei Ivanov; Alan W. Black; David Suendermann-Oeft

Conference Proceedings

Multimodal HALEF: An open-source modular web-based multimodal dialog framework

Lecture Notes in Electrical Engineering (2017) 427 427 LNEE 233-244

DOI: 10.1007/978-981-10-2585-3_18

0Citations

11Readers

Get full text

Abstract

We present an open-source web-based multimodal dialog framework, “Multimodal HALEF”, that integrates video conferencing and telephony abilities into the existing HALEF cloud-based dialog framework via the FreeSWITCH video telephony server. Due to its distributed and cloud-based architecture, Multimodal HALEF allows researchers to collect video and speech data from participants interacting with the dialog system outside of traditional lab settings, therefore largely reducing cost and labor incurred during the traditional audio-visual data collection process. The framework is equipped with a set of tools including a web-based user survey template, a speech transcription, an annotation and rating portal, a web visual processing server that performs head tracking, and a database that logs full-call audio and video recordings as well as other call-specific information. We present observations from an initial data collection based on an job interview application. Finally we report on some future plans for development of the framework.

Author supplied keywords

Cite

CITATION STYLE

APA

Yu, Z., Ramanarayanan, V., Mundkowsky, R., Lange, P., Ivanov, A., Black, A. W., & Suendermann-Oeft, D. (2017). Multimodal HALEF: An open-source modular web-based multimodal dialog framework. In Lecture Notes in Electrical Engineering (Vol. 427 427 LNEE, pp. 233–244). Springer Verlag. https://doi.org/10.1007/978-981-10-2585-3_18

Multimodal HALEF: An open-source modular web-based multimodal dialog framework

Abstract

Author supplied keywords

Cite

Register to see more suggestions