4.3 Article

Nonlinear machine learning in simulations of soft and biological materials

Journal

MOLECULAR SIMULATION
Volume 44, Issue 13-14, Pages 1090-1107

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/08927022.2017.1400164

Keywords

Enhanced sampling; free energy landscapes; non-linear manifold learning; protein folding; self-assembly

Funding

  1. National Science Foundation [DMS-1714212]

Ask authors/readers for more resources

Interpretable parameterisations of free energy landscapes for soft and biological materials calculated from molecular simulation require the availability of good' collective variables (CVs) capable of discriminating the metastable states of the system and the barriers between them. If these CVs are coincident with the slow collective modes governing the long-time dynamical evolution, then they also furnish good coordinates in which to perform enhanced sampling to surmount high free energy barriers and efficiently explore and recover the landscape. Non-linear manifold learning techniques provide a means to systematically extract such CVs from molecular simulation trajectories by identifying and extracting low-dimensional manifolds lying latent within the high-dimensional coordinate space. We survey recent advances in data-driven CV discovery and enhanced sampling using non-linear manifold learning, describe the mathematical and theoretical underpinnings of these techniques, and present illustrative examples to molecular folding and colloidal self-assembly. We close with our outlook and perspective on future advances in this rapidly evolving field.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Chemistry, Physical

Learned Reconstruction of Protein Folding Trajectories from Noisy Single-Molecule Time Series

Maximilian Topel, Ayesha Ejaz, Allison Squires, Andrew L. Ferguson

Summary: Single-molecule Fo''rster resonance energy transfer (smFRET) is used to track the real-time dynamics of molecules. The Takens' Delay Embedding Theorem guarantees that the atomistic dynamics of a system can be represented by a time-delayed embedding of scalar observables. A method called Single-molecule TAkens Reconstruction (STAR) is used to learn the transformation between atomic coordinates and delay-embedded distances accessible to smFRET. STAR has been applied to reconstruct molecular configurations with high accuracy. In this work, the role of signal-to-noise ratio, data volume, and time resolution in simulated smFRET data is investigated to assess the performance of STAR under experimental conditions.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2023)

Editorial Material Chemistry, Physical

Characteristics of Impactful Machine Learning Contributions to The Journal of Physical Chemistry

Andrew L. Ferguson, Jim Pfaendtner

JOURNAL OF PHYSICAL CHEMISTRY A (2023)

Editorial Material Chemistry, Physical

Characteristics of Impactful Machine Learning Contributions to The Journal of Physical Chemistry

Andrew L. Ferguson, Jim Pfaendtner

JOURNAL OF PHYSICAL CHEMISTRY B (2023)

Editorial Material Chemistry, Physical

Characteristics of Impactful Machine Learning Contributions to The Journal of Physical Chemistry

Andrew L. Ferguson, Jim Pfaendtner

JOURNAL OF PHYSICAL CHEMISTRY C (2023)

Article Chemistry, Physical

Computational design of self-assembling peptide chassis materials for synthetic cells

Yutao Ma, Rohan Kapoor, Bineet Sharma, Allen P. Liu, Andrew L. Ferguson

Summary: In this work, an active learning screening method is constructed to identify peptide vesicle sequences with high thermodynamic stabilities. These peptide vesicles are more robust and stable compared to traditional lipid vesicles.

MOLECULAR SYSTEMS DESIGN & ENGINEERING (2023)

Article Biochemistry & Molecular Biology

Computational and Experimental Determination of the Properties, Structure, and Stability of Peptoid Nanosheets and Nanotubes

Mingfei Zhao, Shuai Zhang, Renyu Zheng, Sarah Alamdari, Christopher J. Mundy, Jim Pfaendtner, Lilo D. Pozzo, Chun-Long Chen, James J. De Yoreo, Andrew L. Ferguson

Summary: In this study, the mechanical properties of amphiphilic diblock peptoids were investigated using molecular dynamics simulations and atomic force microscopy. The computational predictions were found to be in good agreement with experimental measurements. Additionally, a theoretical model was developed to analyze the stability of nanotubes.

BIOMACROMOLECULES (2023)

Editorial Material Chemistry, Physical

Virtual Special Issue on Machine Learning in Physical Chemistry, Volume 2-Call for Papers

Andrew L. Ferguson, Jim Pfaendtner

JOURNAL OF PHYSICAL CHEMISTRY A (2023)

Article Chemistry, Physical

Molecular Latent Space Simulators for Distributed and Multimolecular Trajectories

Michael S. S. Jones, Zachary A. A. McDargh, Rafal P. P. Wiewiora, Jesus A. A. Izaguirre, Huafeng Xu, Andrew L. L. Ferguson

Summary: All atom molecular dynamics (MD) simulations are limited by short time steps, but Markov state modeling (MSM) and latent space simulators (LSS) offer alternative approaches to extend the time scales and improve sampling of rare events and metastable states. In this work, LSS models are developed for both single-molecule and multi-molecule systems, generating long continuous trajectories with reduced computational cost. These trajectories provide valuable insights for therapeutic design and optimization of molecules and improve precision in studying DNA folding dynamics.

JOURNAL OF PHYSICAL CHEMISTRY A (2023)

Article Chemistry, Physical

Girsanov Reweighting Enhanced Sampling Technique (GREST): On- the-Fly Data-Driven Discovery of and Enhanced Sampling in Slow Collective Variables

Kirill Shmilovich, Andrew L. Ferguson

Summary: The efficacy of collective variable (CV) enhanced sampling techniques relies on the selection of good CVs correlated with the long-time dynamical behavior of the system. In this work, we introduce Girsanov Reweighting Enhanced Sampling Technique (GREST) as an adaptive sampling scheme that discovers slow CVs and drives sampling of the system. GREST is demonstrated on various systems and provides a publicly available Python package.

JOURNAL OF PHYSICAL CHEMISTRY A (2023)

Editorial Material Chemistry, Physical

Virtual Special Issue on Machine Learning in Physical Chemistry, Volume 2-Call for Papers

Andrew L. Ferguson, Jim Pfaendtner

JOURNAL OF PHYSICAL CHEMISTRY B (2023)

Article Multidisciplinary Sciences

Disruption of energetic and dynamic base pairing cooperativity in DNA duplexes by an abasic site

Brennan Ashwood, Michael S. Jones, Andrew L. Ferguson, Andrei Tokmakoff

Summary: DNA duplex stability is affected by cooperative interactions between adjacent nucleotides and is destabilized by abasic site, which splits the cooperativity in a short duplex into two segments and introduces additional barriers to hybridization. This study used temperature-jump infrared spectroscopy and molecular dynamics simulations to investigate the destabilization mechanism and hybridization dynamics caused by an abasic site in small DNA duplexes.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2023)

Article Chemistry, Multidisciplinary

Denoising Autoencoder Trained on Simulation-Derived Structures for Noise Reduction in Chromatin Scanning Transmission Electron Microscopy

Walter Alvarado, Vasundhara Agrawal, Wing Shun Li, Vinayak P. P. Dravid, Vadim Backman, Juan J. J. de Pablo, Andrew L. L. Ferguson

Summary: We developed a denoising autoencoder (DAE) that outperforms other algorithms in providing high-resolution STEM images of nucleosomes and organized domains within chromatin dense regions. The DAE removes noise commonly found in high-angle annular dark field (HAADF) STEM experiments and learns structural features driven by chromatin folding physics. It allows for the resolution of alpha-tetrahedron tetranucleosome motifs that mediate DNA accessibility and does not find evidence for the suggested 30 nm fiber as the higher-order structure of chromatin fiber.

ACS CENTRAL SCIENCE (2023)

Article Chemistry, Physical

Thermodynamic Basis for the Stabilization of Helical Peptoids by Chiral Sidechains

Sarah Alamdari, Kaylyn Torkelson, Xiaoqian Wang, Chun-Long Chen, Andrew L. Ferguson, Jim Pfaendtner

Summary: In this study, a simulation method is used to investigate the complex folding structures of various 12-mer polypeptoids, resulting in a predictive model that links sidechain chemistry with secondary structure formation. The results show that the assembly of Nrpe and Nspe sequences into polyproline type-I helices in water is enthalpically driven, with minor entropic gains from isomerization and steric strain due to the presence of the chiral center. The overall assembly into a helix is found to be entropically unfavorable, highlighting the importance of considering competing interactions in the rational design of peptoid secondary structure building blocks.

JOURNAL OF PHYSICAL CHEMISTRY B (2023)

Article Biochemical Research Methods

ProtWave-VAE: Integrating Autoregressive Sampling with Latent-Based Inference for Data-Driven Protein Design

Niksa Praljak, Xinran Lian, Rama Ranganathan, Andrew L. Ferguson

Summary: In this work, a novel and lightweight deep generative model called ProtWave-VAE is proposed, which combines the strengths of variational autoencoders (VAEs) and autoregressive (AR) models. The model can be trained on unaligned protein sequence data and generate variable length protein sequences. Experimental results demonstrate that the model can infer meaningful functional and phylogenetic relationships, and achieve high accuracy in protein design predictions.

ACS SYNTHETIC BIOLOGY (2023)

Article Chemistry, Multidisciplinary

Data-driven discovery of innate immunomodulators via machine learning-guided high throughput screening

Yifeng Tang, Jeremiah Y. Kim, Carman K. M. Ip, Azadeh Bahmani, Qing Chen, Matthew G. Rosenberger, Aaron P. Esser-Kahn, Andrew L. Ferguson

Summary: A machine learning-enabled active learning pipeline was developed to guide the screening and discovery of small molecule immunomodulators that can improve immune responses. By using high throughput screening and data-driven predictive models, novel small molecules with enhanced or suppressed innate immune signaling capacity were discovered, and chemical design rules were extracted.

CHEMICAL SCIENCE (2023)

Article Chemistry, Physical

Thermodynamic properties of fluids with Mie n - m potentials and application to tune effective Mie potentials for simple real fluids

B. P. Akhouri, R. Perween, J. R. Solana

Summary: Monte Carlo simulations are used to obtain the equation of state and internal energy of fluids with Mie n-m potentials, and its performance is tested against a third order perturbation theory. The theory is then applied to tune the potentials for real fluids and achieve accurate fit with experimental data.

MOLECULAR SIMULATION (2024)

Article Chemistry, Physical

pH-Dependent conformational stability of SpeB from Thermus thermophilus HB8: insights from molecular dynamics simulation

Malaisamy Veerapandian, Nagarajan Hemavathy, Alagesan Karthika, Jayaraman Manikandan, Umashankar Vetrivel, Jeyaraman Jeyakanthan

Summary: This study investigates the conformational stability and flexibility of SpeB enzyme and its interactions with substrate. The research finds that neutral pH 7 and alkaline pH 11 are the optimal conditions for stable binding between SpeB and substrate.

MOLECULAR SIMULATION (2024)

Article Chemistry, Physical

ReaxFF study of the decarboxylation of methyl palmitate over binary metallic nickel-molybdenum catalysts

Maipelo Nyepetsi, Olayinka A. Oyetunji, Foster Mbaiwa

Summary: Biodiesel, a potential alternative to fossil-based fuels, has limitations such as high viscosity, pour point, and cloud point. This study used ReaxFF molecular dynamics to investigate the decarboxylation of methyl palmitate using different catalysts. The presence of alpha-NiMoO4 and beta-NiMoO4 accelerated the reactions and resulted in higher quantities of stable products. Ni3Mo catalyst showed an initial rapid formation of products followed by a decrease. All reactions followed first-order kinetics, and the catalysts reduced the activation energies.

MOLECULAR SIMULATION (2024)