☆ 4.6 Article

What can associative learning do for planning?

ROYAL SOCIETY OPEN SCIENCE (2018)

Journal

ROYAL SOCIETY OPEN SCIENCE

Volume 5, Issue 11, Pages -

Publisher

ROYAL SOC

DOI: 10.1098/rsos.180778

Keywords

planning; associative learning; reinforcement learning; animal intelligence; flexible behaviour

Funding

Knut and Alice Wallenberg Foundation [KAW 2015.005]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

There is a new associative learning paradox. The power of associative learning for producing flexible behaviour in non-human animals is downplayed or ignored by researchers in animal cognition, whereas artificial intelligence research shows that associative learning models can beat humans in chess. One phenomenon in which associative learning often is ruled out as an explanation for animal behaviour is flexible planning. However, planning studies have been criticized and questions have been raised regarding both methodological validity and interpretations of results. Due to the power of associative learning and the uncertainty of what causes planning behaviour in non-human animals, I explored what associative learning can do for planning. A previously published sequence learning model which combines Pavlovian and instrumental conditioning was used to simulate two planning studies, namely Mulcahy & Call 2006 'Apes save tools for future use.' Science 312, 1038-1040 and Kabadayi & Osvath 2017 'Ravens parallel great apes in flexible planning for tool-use and bartering. 'Science 357, 202-204. Simulations show that behaviour matching current definitions of flexible planning can emerge through associative learning. Through conditioned reinforcement, the learning model gives rise to planning behaviour by learning that a behaviour towards a current stimulus will produce high value food at a later stage; it can make decisions about future states not within current sensory scope. The simulations tracked key patterns both between and within studies. It is concluded that one cannot rule out that these studies of flexible planning in apes and corvids can be completely accounted for by associative learning. Future empirical studies of flexible planning in non-human animals can benefit from theoretical developments within artificial intelligence and animal learning.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6

Not enough ratings

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Hierarchical clustering optimizes the tradeoff between compositionality and expressivity of task structures for flexible reinforcement learning

Rex G. Liu, Michael J. Frank

Summary: A hallmark of human intelligence is the ability to compositionally generalize, which is challenging for reinforcement learning agents. This study proposes a hierarchical RL agent that learns and transfers task components and structures using a non-parametric Bayesian model, maintaining a factorized representation of task components through a hierarchical Dirichlet process.

ARTIFICIAL INTELLIGENCE (2022)