4.7 Article

BayesFlow: Learning Complex Stochastic Models With Invertible Neural Networks

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2020.3042395

Keywords

Data models; Bayes methods; Biological system modeling; Neural networks; Training; Numerical models; Estimation; Bayesian inference; computational and artificial intelligence; machine learning; neural networks; statistical learning

Funding

  1. Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) [GRK 2277]

Ask authors/readers for more resources

This article proposes a globally amortized Bayesian inference method called BayesFlow, based on invertible neural networks. The method learns a global estimator for the probabilistic mapping from observed data to model parameters using simulations. It can infer full posteriors on multiple real data sets without additional training, and incorporates a summary network to embed observed data into informative summary statistics. The utility of BayesFlow is demonstrated on intractable models from various fields.
Estimating the parameters of mathematical models is a common problem in almost all branches of science. However, this problem can prove notably difficult when processes and model descriptions become increasingly complex and an explicit likelihood function is not available. With this work, we propose a novel method for globally amortized Bayesian inference based on invertible neural networks that we call BayesFlow. The method uses simulations to learn a global estimator for the probabilistic mapping from observed data to underlying model parameters. A neural network pretrained in this way can then, without additional training or optimization, infer full posteriors on arbitrarily many real data sets involving the same model family. In addition, our method incorporates a summary network trained to embed the observed data into maximally informative summary statistics. Learning summary statistics from data makes the method applicable to modeling scenarios where standard inference techniques with handcrafted summary statistics fail. We demonstrate the utility of BayesFlow on challenging intractable models from population dynamics, epidemiology, cognitive science, and ecology. We argue that BayesFlow provides a general framework for building amortized Bayesian parameter estimation machines for any forward model from which data can be simulated.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Psychology, Biological

Mental speed is high until age 60 as revealed by analysis of over a million participants

Mischa von Krause, Stefan T. Radev, Andreas Voss

Summary: Response speeds in simple decision-making tasks decline in early and middle adulthood, but this is not solely due to mental speed differences. Using a Bayesian diffusion model and analysis of a large dataset with over one million participants, researchers find that the slowing is mainly caused by increases in decision caution and non-decisional processes. Slowing of mental speed is only observed after the age of approximately 60.

NATURE HUMAN BEHAVIOUR (2022)

Article Psychology, Experimental

Affect dynamics and well-being: explanatory power of the model of intraindividual variability in affect (MIVA)

Maria Wirth, Andreas Voss, Stefan Wirth, Klaus Rothermund

Summary: Research has shown that the current indicators used to capture affect dynamics have limited value in predicting well-being, indicating a need to identify more valid assessment methods.

COGNITION & EMOTION (2022)

Article Physics, Particles & Fields

Inference of cosmic-ray source properties by conditional invertible neural networks

Teresa Bister, Martin Erdmann, Ullrich Koethe, Josina Schulte

Summary: Inference of physical parameters from measured distributions is a crucial task in physics data analysis. Conditional invertible neural networks are an elegant deep learning method that can preserve probability and evaluate posterior distributions. This study compares the performance of conditional invertible neural networks with the traditional method and finds good agreement between the derived physics parameters.

EUROPEAN PHYSICAL JOURNAL C (2022)

Article Psychology

Duration discrimination: A diffusion decision modeling approach

Lukas Schumacher, Andreas Voss

Summary: The human ability to discriminate the duration of stimuli is influenced by the presentation order and the history of previously encountered stimuli. Cognitive models that account for these effects suggest that participants' duration estimation is influenced by an internal reference that evolves throughout the experiment. In this study, a new diffusion model incorporating perceptual discrimination mechanisms accurately predicts performance in a duration discrimination task and is sensitive to different effects.

ATTENTION PERCEPTION & PSYCHOPHYSICS (2023)

Article Astronomy & Astrophysics

Exoplanet characterization using conditional invertible neural networks

Jonas Haldemann, Victor Ksoll, Daniel Walter, Yann Alibert, Ralf S. Klessen, Willy Benz, Ullrich Koethe, Lynton Ardizzone, Carsten Rother

Summary: Researchers propose using conditional invertible neural networks to calculate the posterior probability of planetary structure parameters, which can speed up the inference process for characterizing exoplanets. By training the neural network on a large database of internal structure models, they show that cINNs can infer the composition of an exoplanet much faster than the commonly used MCMC method. However, computing a large database is still required for training the network.

ASTRONOMY & ASTROPHYSICS (2023)

Article Geriatrics & Gerontology

Age Differences in Everyday Emotional Experience: Testing Core Predictions of Socioemotional Selectivity Theory With the MIVA Model

Maria Wirth, Andreas Voss, Klaus Rothermund

Summary: Emotional aging research focuses on age-related improvements in motivation. Socioemotional selectivity theory proposes that as individuals age, they favor emotion-related goals and savor positive but avoid negative emotions. Our computational approach provides partial support for these predictions.

JOURNALS OF GERONTOLOGY SERIES B-PSYCHOLOGICAL SCIENCES AND SOCIAL SCIENCES (2023)

Article Geriatrics & Gerontology

Age-Related Differences in Decision-Making: Evidence Accumulation is More Gradual in Older Age

Eva Marie Wieschen, Aalim Makani, Stefan T. Radev, Andreas Voss, Julia Spaniol

Summary: Older adults exhibit longer response times in various cognitive domains. The diffusion model suggests that this is due to a cautious response style and slower non-decisional processes, rather than a difference in information accumulation rate. The Levy flight model extends the diffusion model by accommodating larger jumps in evidence accumulation and shows that older adults have a more gradual pattern of evidence accumulation compared to younger adults.

EXPERIMENTAL AGING RESEARCH (2023)

Article Mathematics, Interdisciplinary Applications

The Dirichlet Dual Response Model: An Item Response Model for Continuous Bounded Interval Responses

Matthias Kloft, Raphael Hartmann, Andreas Voss, Daniel W. W. Heck

Summary: Standard response formats like rating or visual analogue scales require respondents to condense distributions of latent states or behaviors into a single value, neglecting the variance of distributions. To address this, the dual-range slider is used to measure variability. An extension of the beta response model, the Dirichlet dual response model, is proposed and evaluated for parameter recovery.

PSYCHOMETRIKA (2023)

Article Multidisciplinary Sciences

Neural superstatistics for Bayesian estimation of dynamic cognitive models

Lukas Schumacher, Paul-Christian Buerkner, Andreas Voss, Ullrich Koethe, Stefan T. Radev

Summary: This study proposes a method to add a temporal dimension to cognitive models and estimate their dynamics using a superstatistics perspective. The results show that the deep learning approach is very efficient in capturing the temporal dynamics of the model, and that the erroneous assumption of static or homogeneous parameters will hide important temporal information.

SCIENTIFIC REPORTS (2023)

Article Computer Science, Artificial Intelligence

Amortized Bayesian Model Comparison With Evidential Deep Learning

Stefan T. Radev, Marco D'Alessandro, Ulf K. Mertens, Andreas Voss, Ullrich Koethe, Paul-Christian Buerkner

Summary: Comparing mathematical models of complex processes is important in various scientific fields. The Bayesian probabilistic framework provides a principled way to compare models and extract useful metrics. However, many interesting models cannot be handled by standard Bayesian methods. Therefore, we propose a novel method using specialized deep learning architectures for Bayesian model comparison.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Psychology, Experimental

Is it all about the feeling? Affective and (meta-)cognitive mechanisms underlying the truth effect

Annika Stump, Jan Rummel, Andreas Voss

Summary: The study found that negative emotions can reverse the truth effect, and individuals with higher need for cognitive closure are more susceptible to the truth effect. Additionally, the truth effect diminishes when the repetition interval is long.

PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG (2022)

No Data Available