Abstract
Multimodal conversational agents are an ever expanding field which benefits from the introduction of large language models. Production-ready robust conversational assistants trade breadth of scope for higher accuracy and general dialogue quality. These conversational assistants must be able to maintain the conversation focused, respond appropriately to user requests, maintain a certain level of natural response generation, be robust to out-of-scope and chitchat attempts, and, of course, be accurate in assisting the user in reaching their domain-specific goals. This work discusses data-centric observations, alongside providing research hypothesis for future, and some of my already developed work, to be expanded throughout my PhD.
Author supplied keywords
Cite
CITATION STYLE
Tavares, D. (2022). Zero-shot Generalization of Multimodal Dialogue Agents. In MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia (pp. 6935–6939). Association for Computing Machinery, Inc. https://doi.org/10.1145/3503161.3548759
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.