4.7 Article

Variational autoencoder: An unsupervised model for encoding and decoding fMRI activity in visual cortex

期刊

NEUROIMAGE
卷 198, 期 -, 页码 125-136

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.neuroimage.2019.05.039

关键词

Neural coding; Variational autoencoder; Bayesian brain; Visual reconstruction

资金

  1. NIH at Purdue University [R01MH104402]
  2. College of Engineering at Purdue University

向作者/读者索取更多资源

Goal-driven and feedforward-only convolutional neural networks (CNN) have been shown to be able to predict and decode cortical responses to natural images or videos. Here, we explored an alternative deep neural network, variational auto-encoder (VAE), as a computational model of the visual cortex. We trained a VAE with a five-layer encoder and a five-layer decoder to learn visual representations from a diverse set of unlabeled images. Using the trained VAE, we predicted and decoded cortical activity observed with functional magnetic resonance imaging (fMRI) from three human subjects passively watching natural videos. Compared to CNN, VAE could predict the video-evoked cortical responses with comparable accuracy in early visual areas, but relatively lower accuracy in higher-order visual areas. The distinction between CNN and VAE in terms of encoding performance was primarily attributed to their different learning objectives, rather than their different model architecture or number of parameters. Despite lower encoding accuracies, VAE offered a more convenient strategy for decoding the fMRI activity to reconstruct the video input, by first converting the fMRI activity to the VAE's latent variables, and then converting the latent variables to the reconstructed video frames through the VAE's decoder. This strategy was more advantageous than alternative decoding methods, e.g. partial least squares regression, for being able to reconstruct both the spatial structure and color of the visual input. Such findings highlight VAE as an unsupervised model for learning visual representation, as well as its potential and limitations for explaining cortical responses and reconstructing naturalistic and diverse visual experiences.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Neurosciences

Deep recurrent neural network reveals a hierarchy of process memory during dynamic natural vision

Junxing Shi, Haiguang Wen, Yizhen Zhang, Kuan Han, Zhongming Liu

HUMAN BRAIN MAPPING (2018)

Article Neurosciences

Task-evoked functional connectivity does not explain functional connectivity differences between rest and task conditions

Lauren K. Lynch, Kun-Han Lu, Haiguang Wen, Yizhen Zhang, Andrew J. Saykin, Zhongming Liu

HUMAN BRAIN MAPPING (2018)

Article Multidisciplinary Sciences

Musical Imagery Involves Wernicke's Area in Bilateral and Anti-Correlated Network Interactions in Musicians

Yizhen Zhang, Gang Chen, Haiguang Wen, Kun-Han Lu, Zhongming Liu

SCIENTIFIC REPORTS (2017)

Article Multidisciplinary Sciences

Connecting concepts in the brain by mapping cortical representations of semantic relations

Yizhen Zhang, Kuan Han, Robert Worth, Zhongming Liu

NATURE COMMUNICATIONS (2020)

Article Neurosciences

Representation learning of resting state fMRI with variational autoencoder

Jung-Hoon Kim, Yizhen Zhang, Kuan Han, Zheyu Wen, Minkyu Choi, Zhongming Liu

Summary: By training a variational autoencoder with rsfMRI data, researchers have been able to untangle the underlying sources of brain cortical activity and connectivity, representing spatiotemporal characteristics and driving changes in cortical networks. The resultant latent variables can be used as a reliable feature for accurate subject identification, even with limited data available. This demonstrates the value of VAE for unsupervised representation learning in resting state fMRI activity.

NEUROIMAGE (2021)

Article Engineering, Biomedical

Naturalistic stimuli: A paradigm for multiscale functional characterization of the human brain

Yizhen Zhang, Jung-Hoon Kim, David Brang, Zhongming Liu

Summary: Movies, audio stories, and virtual reality are increasingly used as stimuli for functional brain imaging, setting up more ecologically relevant conditions and inducing highly reproducible brain responses as compared to traditional experimental reductionism.

CURRENT OPINION IN BIOMEDICAL ENGINEERING (2021)

Article Neurosciences

Neural Encoding and Decoding with Deep Learning for Dynamic Natural Vision

Haiguang Wen, Junxing Shi, Yizhen Zhang, Kun-Han Lu, Jiayue Cao, Zhongming Liu

CEREBRAL CORTEX (2018)

Article Neurosciences

Solving large-scale MEG/EEG source localisation and functional connectivity problems simultaneously using state-space models

Jose Sanchez-Bornot, Roberto C. Sotero, J. A. Scott Kelso, Ozguer Simsek, Damien Coyle

Summary: This study proposes a multi-penalized state-space model for analyzing unobserved dynamics, using a data-driven regularization method. Novel algorithms are developed to solve the model, and a cross-validation method is introduced to evaluate regularization parameters. The effectiveness of this method is validated through simulations and real data analysis, enabling a more accurate exploration of cognitive brain functions.

NEUROIMAGE (2024)