4.6 Article

Caching mechanisms for habit formation in Active Inference

Journal

NEUROCOMPUTING
Volume 359, Issue -, Pages 298-314

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2019.05.083

Keywords

Deliberative control; Habitual control; Habitisation; Active Inference; Caching

Funding

  1. European Union's Horizon 2020 Framework Programme for Research and Innovation [785907]
  2. Wellcome Trust Principal Research Fellowship [088130/Z/09/Z]

Ask authors/readers for more resources

A popular distinction in the human and animal learning literature is between deliberate (or willed) and habitual (or automatic) modes of control. Extensive evidence indicates that, after sufficient learning, living organisms develop behavioural habits that permit them saving computational resources. Furthermore, humans and other animals are able to transfer control from deliberate to habitual modes (and vice versa), trading off efficiently flexibility and parsimony - an ability that is currently unparalleled by artificial control systems. Here, we discuss a computational implementation of habit formation, and the transfer of control from deliberate to habitual modes (and vice versa) within Active Inference: a computational framework that merges aspects of cybernetic theory and of Bayesian inference. To model habit formation, we endow an Active Inference agent with a mechanism to cache (or memorize) policy probabilities from previous trials, and reuse them to skip - in part or in full - the inferential steps of deliberative processing. We exploit the fact that the relative quality of policies, conditioned upon hidden states, is constant over trials; provided that contingencies and prior preferences do not change. This means the only quantity that can change policy selection is the prior distribution over the initial state - where this prior is based upon the posterior beliefs from previous trials. Thus, an agent that caches the quality (or the probability) of policies can safely reuse cached values to save on cognitive and computational resources unless contingencies change. Our simulations illustrate the computational benefits, but also the limits, of three caching schemes under Active Inference. They suggest that key aspects of habitual behaviour - such as perseveration - can be explained in terms of caching policy probabilities. Furthermore, they suggest that there may be many kinds (or stages) of habitual behaviour, each associated with a different caching scheme; for example, caching associated or not associated with contextual estimation. These schemes are more or less impervious to contextual and contingency changes. (C) 2019 The Author(s). Published by Elsevier B.V.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Psychiatry

Disorganization of Semantic Brain Networks in Schizophrenia Revealed by fMRI

Yukiko Matsumoto, Satoshi Nishida, Ryusuke Hayashi, Shuraku Son, Akio Murakami, Naganobu Yoshikawa, Hiroyoshi Ito, Naoya Oishi, Naoki Masuda, Toshiya Murai, Karl Friston, Shinji Nishimoto, Hidehiko Takahashi

Summary: This study used functional magnetic resonance imaging (fMRI) to evaluate the large-scale network structures of concept representations in patients with schizophrenia and found that their semantic networks exhibited differences and were associated with thought disorders. This provides pathophysiological evidence for the loosening of associations in schizophrenia.

SCHIZOPHRENIA BULLETIN (2023)

Article Computer Science, Artificial Intelligence

Small steps for mankind: Modeling the emergence of cumulative culture from joint active inference communication

Natalie Kastel, Casper Hesp, K. Richard Ridderinkhof, Karl J. Friston

Summary: This paper proposes a testable deep active inference formulation of social behavior and conducts simulations of cumulative culture. By considering cultural transmission as a bi-directional process of communication and social exchange as a process of active inference, the study discovers that cumulative culture emerges from belief updating through a joint minimization of uncertainty.

FRONTIERS IN NEUROROBOTICS (2023)

Article Public, Environmental & Occupational Health

Using a Dynamic Causal Model to validate previous predictions and offer a 12-month forecast of the long-term effects of the COVID-19 epidemic in the UK

Cam Bowie, Karl Friston

Summary: This study analyzed the COVID-19 epidemic in the past 12 months and made predictions for the next year based on this analysis. It found that changes in transmissibility and public behavior led to an underestimation of the severity of the epidemic in previous predictions. The projections indicate that the number of infections in the coming year will be three times larger than last year, leading to more deaths and economic consequences.

FRONTIERS IN PUBLIC HEALTH (2023)

Review Neurosciences

Generative models for sequential dynamics in active inference

Thomas Parr, Karl Friston, Giovanni Pezzulo

Summary: A central theme of theoretical neurobiology is that most cognitive operations require processing of discrete sequences, and this processing is driven by continuous neuronal dynamics. From the perspective of active inference, we explore sequential brain processing by assuming a generative model of the sensed world. This model includes central pattern generators, narratives, or well-defined sequences, and can account for various aspects of motor control, perception, planning, and understanding.

COGNITIVE NEURODYNAMICS (2023)

Article Computer Science, Artificial Intelligence

Reward Maximization Through Discrete Active Inference

Lancelot Da Costa, Noor Sajid, Thomas Parr, Karl Friston, Ryan Smith

Summary: Active inference is a probabilistic framework based on the principle of minimizing free energy, used for modeling the behavior of biological and artificial agents. It has been successfully applied to various situations involving reward maximization, often yielding comparable or superior results to alternative approaches. This article explores the connection between reward maximization and active inference and demonstrates the conditions under which active inference produces the optimal solution to the Bellman equation, a fundamental equation in reinforcement learning and control. Additionally, it introduces a new recursive active inference scheme that can produce Bellman optimal actions on any finite temporal horizon.

NEURAL COMPUTATION (2023)

Letter Neurosciences

Scientific communication and the semantics of sentience

Brett J. Kagan, Adeel Razi, Anjali Bhat, Andy C. Kitchen, Nhi T. Tran, Forough Habibollahi, Moein Khajehnejad, Bradyn J. Parker, Ben Rollo, Karl J. Friston

NEURON (2023)

Article Neurosciences

Canalization and plasticity in psychopathology

R. L. Carhart-Harris, S. Chandaria, D. E. Erritzoe, A. Gazzaley, M. Girn, H. Kettner, P. A. M. Mediano, D. J. Nutt, F. E. Rosa, L. Roseman, C. Timmermann, B. Weiss, R. J. Zeifman, K. J. Friston

Summary: This theoretical article proposes a new model of a general factor of psychopathology, using the concept of 'canalization'. It distinguishes between two types of plasticity: 'TEMP' and 'canalization', which can be differentiated by their relationship to precision or inverse variance. The authors argue that 'pathological' phenotypes develop through mechanisms of canalization and increased model precision, as a response to adversity and distress. They suggest that TEMP, along with psychological support, can counter canalization and offer experiments and measures to test their hypotheses.

NEUROPHARMACOLOGY (2023)

Correction Computer Science, Artificial Intelligence

Small steps for mankind: Modeling the emergence of cumulative culture from joint active inference communication (vo 16, 944986, 2023)

Natalie Kastel, Casper Hesp, K. Richard Ridderinkhof, Karl J. Friston

FRONTIERS IN NEUROROBOTICS (2023)

Article Pediatrics

Therapeutic touch and therapeutic alliance in pediatric care and neonatology: An active inference framework

Zoe McParlin, Francesco Cerritelli, Andrea Manzotti, Karl J. Friston, Jorge E. Esteves

Summary: Therapeutic affective touch is crucial for survival, nurturing supportive interactions, and promoting overall health. This paper presents an integrative model that combines therapeutic touch and communication to achieve biobehavioural synchrony. It explains the neurophysiological and behavioural mechanisms of developing synchronous relationships through touch and emphasizes the importance of therapeutic touch in building a solid therapeutic alliance.

FRONTIERS IN PEDIATRICS (2023)

Article Neurosciences

Attentional effects on local V1 microcircuits explain selective V1-V4 communication

Christini Katsanevaki, Andre M. Bastos, Hayriye Cagnan, Conrado A. Bosman, Karl J. Friston, Pascal Fries

Summary: Selective attention enhances the influence of specific synaptic inputs on higher-area neurons, enabling preferential routing of attended stimuli. Presynaptic circuits, influenced by top-down attentional signals, play a crucial role in selective routing by selectively entraining postsynaptic neurons. The study demonstrates that attentional modulation of intrinsic connections in the visual cortex mediates selective entrainment, providing an explanation for the observed phenomenon.

NEUROIMAGE (2023)

Review Biology

Beyond simple laboratory studies: Developing sophisticated models to study rich behavior

Antonella Maselli, Jeremy Gordon, Mattia Eluchans, Gian Luca Lancia, Thomas Thiery, Riccardo Moretti, Paul Cisek, Giovanni Pezzulo

Summary: Psychology and neuroscience should adopt innovative experimental designs, measurement methods, analysis techniques, and computational models to study rich, ecologically valid forms of behavior. Studying restricted behaviors in laboratory settings risks missing key aspects of cognitive and neural functions. This article highlights the challenges and opportunities of studying rich forms of behavior and emphasizes the importance of developing sophisticated formal models to understand cognitive and neural processes.

PHYSICS OF LIFE REVIEWS (2023)

Article Psychology, Experimental

Relative fluency (unfelt vs felt) in active inference

Denis Brouillet, Karl Friston

Summary: The brain is known to be a predictive organ that predicts sensory content and the accuracy of its predictions. It must infer the reliability of its own beliefs in order to predict the precision of its predictions. This recognition process leads to the concept of "fluency", which is the perception of having a precise understanding of sensory processes. Changes in fluency, from unfelt to felt, are recognized and realized when updating predictions about accuracy.

CONSCIOUSNESS AND COGNITION (2023)

Article Psychology

Risky decisions are influenced by individual attributes as a function of risk preference

Douglas G. Lee, Marco D'Alessandro, Pierpaolo Iodice, Cinzia Calluso, Aldo Rustichini, Giovanni Pezzulo

Summary: This study provides evidence for the framework where information about individual attributes independently impacts decision-making. The results suggest that risky decisions are resolved by running parallel comparisons between separate attributes, contradicting the assumption of standard economic theory. The study also reveals that attribute salience affects risk preference types differently.

COGNITIVE PSYCHOLOGY (2023)

Article Automation & Control Systems

Interactive Inference: A Multi-Agent Model of Cooperative Joint Actions

Domenico Maisto, Francesco Donnarumma, Giovanni Pezzulo

Summary: This research proposes a computational model based on active inference for multi-agent cooperative joint actions. The model utilizes interactive inference to probabilistically infer the joint goal and updates beliefs and strategies through observation of each other's movements. The results of simulations demonstrate that interactive inference supports successful multi-agent joint actions and replicates key dynamics observed in human-human experiments.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2023)

Review Biology

Path integrals, particular kinds, and strange things

Karl Friston, Lancelot Da Costa, Dalton A. R. Sakthivadivel, Conor Heins, Grigorios A. Pavliotis, Maxwell Ramstead, Thomas Parr

Summary: This paper introduces a path integral formulation of the free energy principle to describe the trajectories of particles over time. By employing the principle of least action, it is possible to simulate the behavior of particles in exchange with their external environment. The paper discusses various types of particles and their different levels of inference or sentience.

PHYSICS OF LIFE REVIEWS (2023)

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)