Generalization guides human exploration in vast decision spaces

Charley M. Wu; Eric Schulz; Maarten Speekenbrink; Jonathan D. Nelson; Björn Meder

Article

Generalization guides human exploration in vast decision spaces

Nature Human Behaviour

DOI: 10.1038/s41562-018-0467-4

172Citations

301Readers

Get full text

Abstract

From foraging for food to learning complex games, many aspects of human behaviour can be framed as a search problem with a vast space of possible actions. Under finite search horizons, optimal solutions are generally unobtainable. Yet, how do humans navigate vast problem spaces, which require intelligent exploration of unobserved actions? Using various bandit tasks with up to 121 arms, we study how humans search for rewards under limited search horizons, in which the spatial correlation of rewards (in both generated and natural environments) provides traction for generalization. Across various different probabilistic and heuristic models, we find evidence that Gaussian process function learning—combined with an optimistic upper confidence bound sampling strategy—provides a robust account of how people use generalization to guide search. Our modelling results and parameter estimates are recoverable and can be used to simulate human-like performance, providing insights about human behaviour in complex environments.

Cite

CITATION STYLE

APA

Wu, C. M., Schulz, E., Speekenbrink, M., Nelson, J. D., & Meder, B. (2018, December 1). Generalization guides human exploration in vast decision spaces. Nature Human Behaviour. Nature Publishing Group. https://doi.org/10.1038/s41562-018-0467-4

Generalization guides human exploration in vast decision spaces

Abstract

Cite

Register to see more suggestions