We introduce a biologically motivated, formal framework or ontology for dealing with many aspects of action discovery which we argue is an example of intrinsically motivated behaviour (as such, this chapter is a companion to that by Redgrave et al. in this volume). We argue that action discovery requires an interplay between separate internal forward models of prediction and inverse models mapping outcomes to actions. The process of learning actions is driven by transient changes in the animal's policy (repetition bias) which is, in turn, a result of unpredicted, phasic sensory information (surprise). The notion of salience as value is introduced and broken down into contributions from novelty (or surprise), immediate reward acquisition, or general task/goal attainment. Many other aspects of biological action discovery emerge naturally in our framework which aims to guide future modelling efforts in this domain.
CITATION STYLE
Gurney, K., Lepora, N., Shah, A., Koene, A., & Redgrave, P. (2013). Action discovery and intrinsic motivation: A biologically constrained formalisation. In Intrinsically Motivated Learning in Natural and Artificial Systems (Vol. 9783642323751, pp. 151–181). Springer-Verlag Berlin Heidelberg. https://doi.org/10.1007/978-3-642-32375-1_7
Mendeley helps you to discover research relevant for your work.