Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal Models

Wout Schellaert; Fernando Martínez-Plumed; Karina Vold; John Burden; Pablo A. M. Casares; Bao Sheng Loe; Roi Reichart; Sean Ó hÉigeartaigh; Anna Korhonen; José Hernández-Orallo

doi:10.1613/jair.1.14157

PDF

Published: Jun 12, 2023

DOI: https://doi.org/10.1613/jair.1.14157

Keywords:

human computer interaction, cognitive modelling

Wout Schellaert

https://orcid.org/0000-0002-9182-4747

Fernando Martínez-Plumed

Karina Vold

John Burden

Pablo A. M. Casares

Bao Sheng Loe

Roi Reichart

Sean Ó hÉigeartaigh

Anna Korhonen

José Hernández-Orallo

Abstract

Even with obvious deficiencies, large prompt-commanded multimodal models are proving to be flexible cognitive tools representing an unprecedented generality. But the directness, diversity, and degree of user interaction create a distinctive “human-centred generality” (HCG), rather than a fully autonomous one. HCG implies that —for a specific user— a system is only as general as it is effective for the user’s relevant tasks and their prevalent ways of prompting. A human-centred evaluation of general-purpose AI systems therefore needs to reflect the personal nature of interaction, tasks and cognition. We argue that the best way to understand these systems is as highly-coupled cognitive extenders, and to analyse the bidirectional cognitive adaptations between them and humans. In this paper, we give a formulation of HCG, as well as a high-level overview of the elements and trade-offs involved in the prompting process. We end the paper by outlining some essential research questions and suggestions for improving evaluation practices, which we envision as characteristic for the evaluation of general artificial intelligence in the future.

This paper appears in the AI & Society track.

Issue

Vol. 77 (2023)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details