Geno: A developer tool for authoring multimodal interaction on existing web applications

16Citations
Citations of this article
38Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Supporting voice commands in applications presents significant benefits to users. However, adding such support to existing GUI-based web apps is effort-consuming with a high learning barrier, as shown in our formative study, due to the lack of unified support for creating multi-modal interfaces. We develop Geno - -a developer tool for adding the voice input modality to existing web apps without requiring significate NLP expertise. Geno provides a unified workflow for developers to specify functionalities to support by voice (intents), create language models for detecting intents and the relevant information (parameters) from user utterances, and fulfill the intents by either programmatically invoking the corresponding functions or replaying GUI actions on the web app. Geno further supports references to GUI context in voice commands (e.g., "add this to the playlist"). In a study, developers with little NLP expertise were able to add the multi-modal support for two existing web apps using Geno.

Cite

CITATION STYLE

APA

Sarmah, R. J., Ding, Y., Wang, D., Lee, C. Y. P., Li, T. J. J., & Chen, X. A. (2020). Geno: A developer tool for authoring multimodal interaction on existing web applications. In UIST 2020 - Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology (pp. 1169–1181). Association for Computing Machinery, Inc. https://doi.org/10.1145/3379337.3415848

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free