☆ 4.7 Article

On the equivalence of Hopfield networks and Boltzmann Machines

NEURAL NETWORKS (2012)

Journal

NEURAL NETWORKS

Volume 34, Issue -, Pages 1-9

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.neunet.2012.06.003

Keywords

Statistical mechanics; Hopfield networks; Boltzmann Machines

Categories

Computer Science, Artificial Intelligence Neurosciences

Funding

Italian Ministry for Education and Research (FIRB) [RBFR08EKEV]
La Sapienza Universita di Roma
GNFM (Gruppo Nazionale per la Fisica Matematica)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

A specific type of neural networks, the Restricted Boltzmann Machines (RBM), are implemented for classification and feature detection in machine learning. They are characterized by separate layers of visible and hidden units, which are able to learn efficiently a generative model of the observed data. We study a hybrid version of RBMs, in which hidden units are analog and visible units are binary, and we show that thermodynamics of visible units are equivalent to those of a Hopfield network, in which the N visible units are the neurons and the P hidden units are the learned patterns. We apply the method of stochastic stability to derive the thermodynamics of the model, by considering a formal extension of this technique to the case of multiple sets of stored patterns, which may act as a benchmark for the study of correlated sets. Our results imply that simulating the dynamics of a Hopfield network, requiring the update of N neurons and the storage of N (N - 1)/2 synapses, can be accomplished by a hybrid Boltzmann Machine, requiring the update of N P neurons but the storage of only NP synapses. In addition, the well known glass transition of the Hopfield network has a counterpart in the Boltzmann Machine: it corresponds to an optimum criterion for selecting the relative sizes of the hidden and visible layers, resolving the trade-off between flexibility and generality of the model. The low storage phase of the Hopfield model corresponds to few hidden units and hence a overly constrained RBM, while the spin-glass phase (too many hidden units) corresponds to unconstrained RBM prone to overfitting of the observed data. (C) 2012 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7

Not enough ratings

Secondary Ratings

Novelty

-

Significance

-

Scientific rigor

-

Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

On the effective initialisation for restricted Boltzmann machines via duality with Hopfield model

Francesca Elisa Leonelli, Elena Agliari, Linda Albanese, Adriano Barra

Summary: This study leverages the equivalence between RBMs and HNN to propose an effective weight initialization method and applies it in a simple auto-encoder model. Additionally, obtaining larger retrieval regions by applying Gram-Schmidt orthogonalisation to the patterns is demonstrated.

NEURAL NETWORKS (2021)

Add to Collection

Article Computer Science, Software Engineering

A primer on the application of neural networks to covering array generation

Ludwig Kampel, Michael Wagner, Ilias S. Kotsireas, Dimitris E. Simos

Summary: In this study, neural networks in the form of Boltzmann machines and Hopfield networks are used to construct a specific class of combinatorial designs called covering arrays. By adapting existing algorithms and conducting comprehensive experimental evaluations, the research demonstrates the first application of neural networks in the field of covering array generation and related discrete structures.

OPTIMIZATION METHODS & SOFTWARE (2022)

Add to Collection

Review Physics, Multidisciplinary

Boltzmann Machines as Generalized Hopfield Networks: A Review of Recent Results and Outlooks

Chiara Marullo, Elena Agliari

Summary: The Hopfield model and the Boltzmann machine are popular neural network examples used for classification, feature detection, and generative model learning. They are closely related and can be exactly mapped to each other, representing two sides of the same cognitive process.

ENTROPY (2021)

Add to Collection

Article Computer Science, Artificial Intelligence

Balanced clustering based on collaborative neurodynamic optimization

Xiangguang Dai, Jun Wang, Wei Zhang

Summary: This paper presents a collaborative neurodynamic algorithm for balanced clustering, which solves the combinatorial optimization problem of balanced clustering by using a population of discrete Hopfield networks or Boltzmann machines. Experimental results demonstrate that the proposed algorithm outperforms four existing balanced clustering algorithms in terms of balanced clustering quality.

KNOWLEDGE-BASED SYSTEMS (2022)

Add to Collection

Article Physics, Multidisciplinary

Thermodynamics of bidirectional associative memories

Adriano Barra, Giovanni Catania, Aurelien Decelle, Beatriz Seoane

Summary: In this paper, the equilibrium properties of bidirectional associative memories (BAMs) are investigated. The computational capabilities of a stochastic extension of BAM are characterized using statistical physics techniques. The phase diagram of the model at the replica symmetric level is provided, and the transition curves are analyzed as control parameters are tuned. The retrieval mechanism in BAM is explained by analogy with two interacting Hopfield models, and the potential equivalence with two coupled Restricted Boltzmann Machines is discussed.

JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL (2023)

Add to Collection

Article Physics, Multidisciplinary

Microcanonical ensemble based on the superstatistics with the free Hamiltonian as a stochastic variable

Won Sang Chung, Abdullah Algin

Summary: This work first develops a general procedure for obtaining the superstatistical density of states with nonzero variance in the probability density function. The microcanonical ensemble based on superstatistics with the free Hamiltonian as a stochastic variable is then discussed. Finally, the formalism presented here is applied and analyzed in detail within the framework of different probability distributions, such as three states distribution, the Gamma distribution, the q-deformed Dirac delta distribution, and the Poisson distribution.

EUROPEAN PHYSICAL JOURNAL PLUS (2022)

Add to Collection

Article Materials Science, Multidisciplinary

The mathematics of the ensemble theory

Xiang Gao

Summary: This study reveals that the generalized Boltzmann distribution is mathematically consistent with thermodynamics and challenges the fundamental assumptions of statistical mechanics. It provides a new approach to derive the Boltzmann distribution and could have implications for non-Boltzmann-Gibbs statistical mechanics and philosophical studies on its foundations.

RESULTS IN PHYSICS (2022)

Add to Collection

Article Physics, Multidisciplinary

Neural Network Representation of Tensor Network and Chiral States

Yichen Huang, Joel E. Moore

Summary: We study the representational power of Boltzmann machines in quantum many-body systems and prove that any local tensor network state can be represented by a local neural network. Despite difficulties in representing chiral topological states using local tensor networks, we successfully construct a quasilocal neural network representation for a chiral p-wave superconductor, demonstrating the strength of Boltzmann machines.

PHYSICAL REVIEW LETTERS (2021)

Add to Collection

Article Physics, Fluids & Plasmas

Equilibrium and nonequilibrium description of negative temperature states in a one-dimensional lattice using a wave kinetic approach

M. Onorato, G. Dematteis, D. Proment, A. Pezzi, M. Ballarin, L. Rondoni

Summary: In this study, we predict the presence of negative temperature states in the discrete nonlinear Schodinger (DNLS) equation and provide exact solutions using the associated wave kinetic equation. We define an entropy within the wave kinetic approach that monotonically increases in time and reaches a stationary state in accordance with classical equilibrium statistical mechanics. Our analysis shows that fluctuations of actions at fixed wave numbers relax to their equilibrium behavior faster than the spectrum reaches equilibrium. Numerical simulations of the DNLS equation confirm our theoretical results. The boundedness of the dispersion relation is found to be critical for observing negative temperatures in lattices characterized by two invariants.

PHYSICAL REVIEW E (2022)

Add to Collection

Article Materials Science, Multidisciplinary

Boltzmann machines as two-dimensional tensor networks

Sujie Li, Feng Pan, Pengfei Zhou, Pan Zhang

Summary: Restricted Boltzmann machines (RBMs) and deep Boltzmann machines (DBMs) are important models in machine learning, with recent applications in quantum many-body physics. This study establishes fundamental connections between RBMs and DBMs with tensor networks, and presents an efficient algorithm for computing their partition functions, showing improved accuracy compared to state-of-the-art methods. The research highlights potential applications in training DBMs and estimating the partition function of RBMs.

PHYSICAL REVIEW B (2021)

Add to Collection

Article Physics, Multidisciplinary

Observation of Light Thermalization to Negative-Temperature Rayleigh-Jeans Equilibrium States in Multimode Optical Fibers

K. Baudin, J. Garnier, A. Fusaro, N. Berti, C. Michel, K. Krupa, G. Millot, A. Picozzi

Summary: This paper reports the observation of Rayleigh-Jeans thermalization of light waves to negative-temperature equilibrium states. The optical wave relaxes to the equilibrium state through its propagation in a multimode optical fiber, where high energy levels are more populated than low energy levels. Experimental results show that negative-temperature speckle beams have a nonmonotonic radial intensity profile.

PHYSICAL REVIEW LETTERS (2023)

Add to Collection

Article Physics, Multidisciplinary

Senses along Which the Entropy Sq Is Unique

Constantino Tsallis

Summary: The Boltzmann-Gibbs-von Neumann-Shannon additive entropy and its nonextensive counterpart have provided a foundation for statistical mechanics in both classical and quantum systems. However, the increasing complexity of natural, artificial, and social systems has made it necessary to develop nonadditive entropic functionals. Among them, the nonextensive entropy Sq has played a special role in the study of complex systems.

ENTROPY (2023)

Add to Collection

Article Mathematics, Applied

Hopfield model with planted patterns: A teacher-student self-supervised learning model

Francesco Alemanno, Luca Camanzi, Gianluca Manzan, Daniele Tantari

Summary: While Hopfield networks are widely used for memory storage and retrieval, this study explores the possibility of using Boltzmann machines for self-supervised learning. By generalizing the Hopfield model with structured patterns, the learning performance is analyzed based on the size of the training set, dataset noise, and weight regularization. The results show that with an informative dataset, the machine can learn through memorization, while with a noisy dataset, a critical number of examples is needed for generalization.

APPLIED MATHEMATICS AND COMPUTATION (2023)

Add to Collection

Article Chemistry, Physical

Solving the Schro?dinger Equation in the Configuration Space with Generative Machine Learning

Basile Herzog, Bastien Casier, Sebastin Lebegue, Dario Rocca

Summary: The configuration interaction approach is a powerful method for solving the Schrödinger equation in realistic molecules and materials, but it has a scalability issue that limits its practical use. In this study, we propose a machine learning approach to selectively generate important configurations, which leads to faster convergence to chemical accuracy compared to random sampling or Monte Carlo configuration interaction method. This work opens up new possibilities for using generative models to solve electronic structure problems.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

Information geometry of hyperbolic-valued Boltzmann machines

Masaki Kobayashi

Summary: Information geometry is introduced to analyze hyperbolic-valued neural networks, proving that they form an exponential family and providing natural and mixture parameters, determining the Fisher metric, proving the existence of mixed parameters for all distributions, which are useful for learning algorithms.

NEUROCOMPUTING (2021)

Add to Collection

Article Physics, Mathematical

A Mean-Field Monomer-Dimer Model with Randomness: Exact Solution and Rigorous Results

Diego Alberici, Pierluigi Contucci, Emanuele Mingione

JOURNAL OF STATISTICAL PHYSICS (2015)

Add to Collection

Article Social Sciences, Interdisciplinary

Egalitarianism in the rank aggregation problem: a new dimension for democracy

Pierluigi Contucci, Emanuele Panizzi, Federico Ricci-Tersenghi, Alina Sirbu

QUALITY & QUANTITY (2016)

Add to Collection

Article Multidisciplinary Sciences

Enhancing participation to health screening campaigns by group interactions

Raffaella Burioni, Pierluigi Contucci, Micaela Fedele, Cecilia Vernia, Alessandro Vezzani

SCIENTIFIC REPORTS (2015)

Add to Collection

Article Physics, Mathematical

Limit Theorems for Monomer-Dimer Mean-Field Models with Attractive Potential

Diego Alberici, Pierluigi Contucci, Micaela Fedele, Emanuele Mingione

COMMUNICATIONS IN MATHEMATICAL PHYSICS (2016)

Add to Collection

Article Physics, Multidisciplinary

Non-Gaussian fluctuations in monomer-dimer models

Diego Alberici, Pierluigi Contucci, Emanuele Mingione

EPL (2016)

Add to Collection

Article Mathematics, Interdisciplinary Applications

Forecasting the integration of immigrants

Pierluigi Contucci, Rickard Sandell, Seyedalireza Seyedi

JOURNAL OF MATHEMATICAL SOCIOLOGY (2017)

Add to Collection

Article Physics, Multidisciplinary

Inverse problem for the mean-field monomer-dimer model with attractive interaction

Pierluigi Contucci, Rachele Luzi, Cecilia Vernia

JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL (2017)

Add to Collection

Article Physics, Multidisciplinary

Multi-Species Mean Field Spin Glasses. Rigorous Results

Adriano Barra, Pierluigi Contucci, Emanuele Mingione, Daniele Tantari

ANNALES HENRI POINCARE (2015)

Add to Collection

Article Physics, Mathematical

Solution of the Monomer-Dimer Model on Locally Tree-Like Graphs. Rigorous Results

Diego Alberici, Pierluigi Contucci

COMMUNICATIONS IN MATHEMATICAL PHYSICS (2014)

Add to Collection

Article Social Sciences, Interdisciplinary

The lack of probability culture in Italy. Toward an international comparative research program

Pierluigi Contucci, Candia Riga

QUALITY & QUANTITY (2015)

Add to Collection

Article Physics, Mathematical

A Multi-scale Spin-Glass Mean-Field Model

Pierluigi Contucci, Emanuele Mingione

COMMUNICATIONS IN MATHEMATICAL PHYSICS (2019)

Add to Collection

Article Physics, Multidisciplinary

Voter-like Dynamics with Conflicting Preferences on Modular Networks

Filippo Zimmaro, Pierluigi Contucci, Janos Kertesz

Summary: Social coordination and personal preferences are important factors shaping an individual's opinion. The topology of the network of interactions also plays a significant role. This study examines an extension of the voter model, where agents are divided into populations with opposite preferences, using both analytical and simulation methods. The results show that the modular structure of the network increases polarization, and the success of imposing one group's preferred opinion on the other depends on the segregation of the latter population rather than the topological structure of the former. The mean-field approach is compared with the pair approximation, and its predictions are validated on a real network.

ENTROPY (2023)

Add to Collection

Article Physics, Fluids & Plasmas

Inverse problem beyond two-body interaction: The cubic mean-field Ising model

Pierluigi Contucci, Godwin Osabutey, Cecilia Vernia

Summary: In this paper, we solve the inverse problem of the cubic mean-field Ising model by reconstructing the free parameters of the system from configuration data generated according to the model's distribution. We test the robustness of this inversion procedure in both the region of solution uniqueness and the region with multiple thermodynamic phases.

PHYSICAL REVIEW E (2023)

Add to Collection

Article Physics, Multidisciplinary

Aggregation models on hypergraphs

Diego Alberici, Pierluigi Contucci, Emanuele Mingione, Marco Molari

ANNALS OF PHYSICS (2017)

Add to Collection

Article Demography

How integrated are immigrants?

Pierluigi Contucci, Rickard Sandell

DEMOGRAPHIC RESEARCH (2015)

Add to Collection

Article Computer Science, Artificial Intelligence

Reduced-complexity Convolutional Neural Network in the compressed domain

Hamdan Abdellatef, Lina J. Karam

Summary: This paper proposes performing the learning and inference processes in the compressed domain to reduce computational complexity and improve speed of neural networks. Experimental results show that modified ResNet-50 in the compressed domain is 70% faster than traditional spatial-based ResNet-50 while maintaining similar accuracy. Additionally, a preprocessing step with partial encoding is suggested to improve resilience to distortions caused by low-quality encoded images. Training a network with highly compressed data can achieve good classification accuracy with significantly reduced storage requirements.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Theoretical limits on the speed of learning inverse models explain the rate of adaptation in arm reaching tasks

Victor R. Barradas, Yasuharu Koike, Nicolas Schweighofer

Summary: Inverse models are essential for human motor learning as they map desired actions to motor commands. The shape of the error surface and the distribution of targets in a task play a crucial role in determining the speed of learning.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Learning a robust foundation model against clean-label data poisoning attacks at downstream tasks

Ting Zhou, Hanshu Yan, Jingfeng Zhang, Lei Liu, Bo Han

Summary: We propose a defense strategy that reduces the success rate of data poisoning attacks in downstream tasks by pre-training a robust foundation model.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for neural networks

Hao Sun, Li Shen, Qihuang Zhong, Liang Ding, Shixiang Chen, Jingwei Sun, Jing Li, Guangzhong Sun, Dacheng Tao

Summary: In this paper, the convergence rate of AdaSAM in the stochastic non-convex setting is analyzed. Theoretical proof shows that AdaSAM has a linear speedup property and decouples the stochastic gradient steps with the adaptive learning rate and perturbed gradient. Experimental results demonstrate that AdaSAM outperforms other optimizers in terms of performance.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Grasping detection of dual manipulators based on Markov decision process with neural network

Juntong Yun, Du Jiang, Li Huang, Bo Tao, Shangchun Liao, Ying Liu, Xin Liu, Gongfa Li, Disi Chen, Baojia Chen

Summary: In this study, a dual manipulator grasping detection model based on the Markov decision process is proposed. By parameterizing the grasping detection model of dual manipulators using a cross entropy convolutional neural network and a full convolutional neural network, stable grasping of complex multiple objects is achieved. Robot grasping experiments were conducted to verify the feasibility and superiority of this method.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Asymmetric double networks mutual teaching for unsupervised person Re-identification

Miaohui Zhang, Kaifang Li, Jianxin Ma, Xile Wang

Summary: This paper proposes an unsupervised person re-identification (Re-ID) method that uses two asymmetric networks to generate pseudo-labels for each other by clustering and updates and optimizes the pseudo-labels through alternate training. It also designs similarity compensation and similarity suppression based on the camera ID of pedestrian images to optimize the similarity measure. Extensive experiments show that the proposed method achieves superior performance compared to state-of-the-art unsupervised person re-identification methods.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Low-variance Forward Gradients using Direct Feedback Alignment and momentum

Florian Bacho, Dominique Chu

Summary: This paper proposes a new approach called the Forward Direct Feedback Alignment algorithm for supervised learning in deep neural networks. By combining activity-perturbed forward gradients, direct feedback alignment, and momentum, this method achieves better performance and convergence speed compared to other local alternatives to backpropagation.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Maximum margin and global criterion based-recursive feature selection

Xiaojian Ding, Yi Li, Shilin Chen

Summary: This research paper addresses the limitations of recursive feature elimination (RFE) and its variants in high-dimensional feature selection tasks. The proposed algorithms, which introduce a novel feature ranking criterion and an optimal feature subset evaluation algorithm, outperform current state-of-the-art methods.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Mental image reconstruction from human brain activity: Neural decoding of mental imagery via deep neural network-based Bayesian estimation

Naoko Koide-Majima, Shinji Nishimoto, Kei Majima

Summary: Visual images observed by humans can be reconstructed from brain activity, and the visualization of arbitrary natural images from mental imagery has been achieved through an improved method. This study provides a unique tool for directly investigating the subjective contents of the brain.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Hierarchical attention network with progressive feature fusion for facial expression recognition

Huanjie Tao, Qianyue Duan

Summary: In this paper, a hierarchical attention network with progressive feature fusion is proposed for facial expression recognition (FER), addressing the challenges posed by pose variation, occlusions, and illumination variation. The model achieves enhanced performance by aggregating diverse features and progressively enhancing discriminative features.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

SLAPP: Subgraph-level attention-based performance prediction for deep learning models

Zhenyi Wang, Pengfei Yang, Linwei Hu, Bowen Zhang, Chengmin Lin, Wenkai Lv, Quan Wang

Summary: In the face of the complex landscape of deep learning, we propose a novel subgraph-level performance prediction method called SLAPP, which combines graph and operator features through an innovative graph neural network called EAGAT, providing accurate performance predictions. In addition, we introduce a mixed loss design with dynamic weight adjustment to improve predictive accuracy.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

LDCNet: Lightweight dynamic convolution network for laparoscopic procedures image segmentation

Yiyang Yin, Shuangling Luo, Jun Zhou, Liang Kang, Calvin Yu-Chian Chen

Summary: Medical image segmentation is crucial for modern healthcare systems, especially in reducing surgical risks and planning treatments. Transanal total mesorectal excision (TaTME) has become an important method for treating colon and rectum cancers. Real-time instance segmentation during TaTME surgeries can assist surgeons in minimizing risks. However, the dynamic variations in TaTME images pose challenges for accurate instance segmentation.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

start-stop points CenterNet for wideband signals detection and time-frequency localization in spectrum sensing

Teng Cheng, Lei Sun, Junning Zhang, Jinling Wang, Zhanyang Wei

Summary: This study proposes a scheme that combines the start-stop point signal features for wideband multi-signal detection, called Fast Spectrum-Size Self-Training network (FSSNet). By utilizing start-stop points to build the signal model, this method successfully solves the difficulty of existing deep learning methods in detecting discontinuous signals and achieves satisfactory detection speed.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Learning deep representation and discriminative features for clustering of multi-layer networks

Wenming Wu, Xiaoke Ma, Quan Wang, Maoguo Gong, Quanxue Gao

Summary: The layer-specific modules in multi-layer networks are critical for understanding the structure and function of the system. However, existing methods fail to accurately characterize and balance the connectivity and specificity of these modules. To address this issue, a joint learning graph clustering algorithm (DRDF) is proposed, which learns the deep representation and discriminative features of the multi-layer network, and balances the connectivity and specificity of the layer-specific modules through joint learning.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Boundary uncertainty aware network for automated polyp segmentation

Guanghui Yue, Guibin Zhuo, Weiqing Yan, Tianwei Zhou, Chang Tang, Peng Yang, Tianfu Wang

Summary: This paper proposes a novel boundary uncertainty aware network (BUNet) for precise and robust colorectal polyp segmentation. BUNet utilizes a pyramid vision transformer encoder to learn multi-scale features and incorporates a boundary exploration module (BEM) and a boundary uncertainty aware module (BUM) to handle boundary areas. Experimental results demonstrate that BUNet outperforms other methods in terms of performance and generalization ability.

NEURAL NETWORKS (2024)

Add to Collection

© Peeref 2019-2024. All rights reserved.