Dopamine, reward learning, and active inference

eFitzgerald, Thomas; FitzGerald, Thomas H. B.; eDolan, Ray; eFriston, Karl; Dolan, Raymond J.; Friston, Karl

Published in

Frontiers Media, Frontiers in Computational Neuroscience, (9), 2015

DOI: 10.3389/fncom.2015.00136

Tools

Export citation

Search in Google Scholar

Dopamine, reward learning, and active inference

Journal article published in 2015 by Thomas eFitzgerald, Thomas H. B. FitzGerald

, Ray eDolan, Karl eFriston, Raymond J. Dolan, Karl Friston

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Temporal difference learning models propose phasic dopamine signaling encodes reward prediction errors that drive learning. This is supported by studies where optogenetic stimulation of dopamine neurons can stand in lieu of actual reward. Nevertheless, a large body of data also shows that dopamine is not necessary for learning, and that dopamine depletion primarily affects task performance. We offer a resolution to this paradox based on an hypothesis that dopamine encodes the precision of beliefs about alternative actions, and thus controls the outcome-sensitivity of behavior. We extend an active inference scheme for solving Markov decision processes to include learning, and show that simulated dopamine dynamics strongly resemble those actually observed during instrumental conditioning. Furthermore, simulated dopamine depletion impairs performance but spares learning, while simulated excitation of dopamine neurons drives reward learning, through aberrant inference about outcome states. Our formal approach provides a novel and parsimonious reconciliation of apparently divergent experimental findings.

Published in

Links

Tools

Dopamine, reward learning, and active inference

Abstract