Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal Models

Wout Schellaert; Fernando Martínez-Plumed; Karina Vold; John Burden; Pablo A.M. Casares; Bao Sheng Loe; Roi Reichart; Sean Héigeartaigh; Anna Korhonen; José Hernández-Orallo

Journal ArticleOPEN ACCESS

Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal Models

Journal of Artificial Intelligence Research (2023) 77 377-394

DOI: 10.1613/jair.1.14157

1Citations

8Readers

Get full text

Abstract

Even with obvious deficiencies, large prompt-commanded multimodal models are proving to be flexible cognitive tools representing an unprecedented generality. But the directness, diversity, and degree of user interaction create a distinctive "human-centred generality"(HCG), rather than a fully autonomous one. HCG implies that - for a specific user - a system is only as general as it is effective for the user's relevant tasks and their prevalent ways of prompting. A human-centred evaluation of general-purpose AI systems therefore needs to reflect the personal nature of interaction, tasks and cognition. We argue that the best way to understand these systems is as highly-coupled cognitive extenders, and to analyse the bidirectional cognitive adaptations between them and humans. In this paper, we give a formulation of HCG, as well as a high-level overview of the elements and trade-offs involved in the prompting process. We end the paper by outlining some essential research questions and suggestions for improving evaluation practices, which we envision as characteristic for the evaluation of general artificial intelligence in the future.

Cite

CITATION STYLE

APA

Schellaert, W., Martínez-Plumed, F., Vold, K., Burden, J., Casares, P. A. M., Loe, B. S., … Hernández-Orallo, J. (2023). Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal Models. Journal of Artificial Intelligence Research, 77, 377–394. https://doi.org/10.1613/jair.1.14157

Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal Models

Abstract

Cite

Register to see more suggestions