Temporal difference models describe higher-order learning in humans

Seymour, Ben; O'Doherty, John P.; Dayan, Peter; Koltzenburg, Martin; Jones, Anthony K.; Dolan, Raymond J.; Friston, Karl J.; Frackowiak, Richard S.

Published in

Nature Research, Nature, 6992(429), p. 664-667, 2004

DOI: 10.1038/nature02581

Tools

Export citation

Search in Google Scholar

Temporal difference models describe higher-order learning in humans

Journal article published in 2004 by Ben Seymour, John P. O'Doherty, Peter Dayan, Martin Koltzenburg

, Anthony K. Jones, Raymond J. Dolan, Karl J. Friston, Richard S. Frackowiak

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

The ability to use environmental stimuli to predict impending harm is critical for survival. Such predictions should be available as early as they are reliable. In pavlovian conditioning, chains of successively earlier predictors are studied in terms of higher-order relationships, and have inspired computational theories such as temporal difference learning1. However, there is at present no adequate neurobiological account of how this learning occurs. Here, in a functional magnetic resonance imaging (fMRI) study of higher-order aversive conditioning, we describe a key computational strategy that humans use to learn predictions about pain. We show that neural activity in the ventral striatum and the anterior insula displays a marked correspondence to the signals for sequential learning predicted by temporal difference models. This result reveals a flexible aversive learning process ideally suited to the changing and uncertain nature of real-world environments. Taken with existing data on reward learning2, our results suggest a critical role for the ventral striatum in integrating complex appetitive and aversive predictions to coordinate behaviour.

Published in

Links

Tools

Temporal difference models describe higher-order learning in humans

Abstract