期刊
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE
卷 5, 期 -, 页码 -出版社
FRONTIERS RES FOUND
DOI: 10.3389/fncom.2011.00021
关键词
reinforcement learning; dopamine; STDP; neuronal excitability; prefrontal cortex; basal ganglia
资金
- EPSRC [EP/C537912/1, EP/G007543/1]
- Dr. Mortimer and Theresa Sackler Foundation
- EPSRC [EP/G007543/1] Funding Source: UKRI
- Engineering and Physical Sciences Research Council [EP/G007543/1] Funding Source: researchfish
Dopaminergic neurons in the mammalian substantia nigra display characteristic phasic responses to stimuli which reliably predict the receipt of primary rewards. These responses have been suggested to encode reward prediction-errors similar to those used in reinforcement learning. Here, we propose a model of dopaminergic activity in which prediction-error signals are generated by the joint action of short-latency excitation and long-latency inhibition, in a network undergoing dopaminergic neuromodulation of both spike-timing dependent synaptic plasticity and neuronal excitability. In contrast to previous models, sensitivity to recent events is maintained by the selective modification of specific striatal synapses, efferent to cortical neurons exhibiting stimulus-specific, temporally extended activity patterns. Our model shows, in the presence of significant background activity, (i) a shift in dopaminergic response from reward to reward-predicting stimuli, (ii) preservation of a response to unexpected rewards, and (iii) a precisely timed below-baseline dip in activity observed when expected rewards are omitted.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据