We present a novel behavioural task for the investigation of how actions are added to an agent's repertoire. In the task, free exploration of the range of possible movements with a manipulandum, such as a joystick, is recorded. A subset of these movements trigger a reinforcing signal. Our interest is in how those elements of total behaviour which cause an unexpected outcome are identified and stored. This process is necessarily prior to the attachment of value to different actions [Redgrave, P., Gurney, K.: The short-latency dopamine signal: A role in discovering novel actions? Nat. Rev. Neurosci. 7(12), 967-975 (2006)]. The task allows for critical tests of reinforcement prediction error theories [e.g. Schultz, W., Dayan, P., Montague, P.: A neural substrate of prediction and reward. Science 275, 1593-1599 (1997)], as well as providing a window on a number of other issues in action learning. The task provides a paradigm where the exploratory motive drives learning, and as such we view it as in the tradition of Thorndike [Animal intelligence (1911)]. Our task is easily scalable in difficulty, is adaptable across species and provides a rich set of behavioural measures throughout the action-learning process. Targets can be defined in spatial, temporal or kinematic/gestural terms, and the task also allows the concatenation of actions to be investigated. Action learning requires integration across spatial, kinematic and temporal dimensions. The task affords insight into these (and into the process of integration).
CITATION STYLE
Stafford, T., Walton, T., Hetherington, L., Thirkettle, M., Gurney, K., & Redgrave, P. (2013). A novel behavioural task for researching intrinsic motivations. In Intrinsically Motivated Learning in Natural and Artificial Systems (Vol. 9783642323751, pp. 395–410). Springer-Verlag Berlin Heidelberg. https://doi.org/10.1007/978-3-642-32375-1_15
Mendeley helps you to discover research relevant for your work.