4.0 Article

Model-based reinforcement learning under concurrent schedules of reinforcement in rodents

期刊

LEARNING & MEMORY
卷 16, 期 5, 页码 315-323

出版社

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/lm.1295509

关键词

-

资金

  1. Korea Research Foundation [KRF-2005-216-E00058]
  2. Department of Medical Sciences, the Graduate School, Ajou University
  3. Korea Science and Engineering Foundation [R01-2008-000-10287-0]
  4. Korea Healthcare Technology Research and Development [A080742]
  5. Cognitive Neuroscience Program of the Korea Ministry of Science and Technology
  6. National Research Foundation of Korea [2006-2005112, 2006-2005110, 2005-216-E00058, 2008-0057446, R01-2008-000-10287-0] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

向作者/读者索取更多资源

Reinforcement learning theories postulate that actions are chosen to maximize a long-term sum of positive outcomes based on value functions, which are subjective estimates of future rewards. In simple reinforcement learning algorithms, value functions are updated only by trial-and-error, whereas they are updated according to the decision-maker's knowledge or model of the environment in model-based reinforcement learning algorithms. To investigate how animals update value functions, we trained rats under two different free-choice tasks. The reward probability of the unchosen target remained unchanged in one task, whereas it increased over time since the target was last chosen in the other task. The results show that goal choice probability increased as a function of the number of consecutive alternative choices in the latter, but not the former task, indicating that the animals were aware of time-dependent increases in arming probability and used this information in choosing goals. In addition, the choice behavior in the latter task was better accounted for by a model-based reinforcement learning algorithm. Our results show that rats adopt a decision-making process that cannot be accounted for by simple reinforcement learning models even in a relatively simple binary choice task, suggesting that rats can readily improve their decision-making strategy through the knowledge of their environments.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据