4.7 Article

Markov state models from short non-equilibrium simulations-Analysis and correction of estimation bias

期刊

JOURNAL OF CHEMICAL PHYSICS
卷 146, 期 9, 页码 -

出版社

AMER INST PHYSICS
DOI: 10.1063/1.4976518

关键词

-

资金

  1. Deutsche Forschungsgemeinschaft [SFB 958, SFB 1114]
  2. European Commission through ERC starting grant pcCell
  3. National Science Foundation [CHE-1265929]
  4. Welch Foundation [C-1570]
  5. Direct For Mathematical & Physical Scien
  6. Division Of Chemistry [1265929] Funding Source: National Science Foundation
  7. Direct For Mathematical & Physical Scien
  8. Division Of Physics [1427654] Funding Source: National Science Foundation

向作者/读者索取更多资源

Many state-of-the-art methods for the thermodynamic and kinetic characterization of large and complex biomolecular systems by simulation rely on ensemble approaches, where data from large numbers of relatively short trajectories are integrated. In this context, Markov state models (MSMs) are extremely popular because they can be used to compute stationary quantities and long-time kinetics from ensembles of short simulations, provided that these short simulations are in local equilibrium within the MSM states. However, over the last 15 years since the inception of MSMs, it has been controversially discussed and not yet been answered how deviations from local equilibrium can be detected, whether these deviations induce a practical bias in MSM estimation, and how to correct for them. In this paper, we address these issues: We systematically analyze the estimation of MSMs from short non-equilibrium simulations, and we provide an expression for the error between unbiased transition probabilities and the expected estimate from many short simulations. We show that the unbiased MSM estimate can be obtained even from relatively short non-equilibrium simulations in the limit of long lag times and good discretization. Further, we exploit observable operator model (OOM) theory to derive an unbiased estimator for the MSM transition matrix that corrects for the effect of starting out of equilibrium, even when short lag times are used. Finally, we show how the OOM framework can be used to estimate the exact eigenvalues or relaxation time scales of the system without estimating an MSM transition matrix, which allows us to practically assess the discretization quality of the MSM. Applications to model systems and molecular dynamics simulation data of alanine dipeptide are included for illustration. The improved MSM estimator is implemented in PyEMMA of version 2.3. Published by AIP Publishing.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Chemistry, Physical

Neural mode jump Monte Carlo

Luigi Sbailo, Manuel Dibak, Frank Noe

Summary: The proposed method uses generative neural networks to connect metastable regions directly, propose new configurations in the Markov chain, and optimize the acceptance probability of large jumps between modes in the configuration space. It effectively increases the convergence speed of systems with multiple metastable states.

JOURNAL OF CHEMICAL PHYSICS (2021)

Article Mathematics, Applied

tgEDMD: Approximation of the Kolmogorov Operator in Tensor Train Format

Marvin Luecke, Feliks Nueske

Summary: This study focuses on extracting information about dynamical systems from simulation data through modeling the Koopman operator semigroup. Recent work has been centered on deriving data-efficient representations of the Koopman operator in low-rank tensor formats and applying this to approximate the generator. The method presents consistency and complexity analysis, extensions to practical settings, and demonstrations of its applicability to benchmark numerical examples.

JOURNAL OF NONLINEAR SCIENCE (2022)

Article Physics, Multidisciplinary

Koopman analysis of quantum systems*

Stefan Klus, Feliks Nueske, Sebastian Peitz

Summary: Koopman operator theory has wide applications in various research areas and this paper demonstrates the use of data-driven methods to analyze quantum physics problems and solve the Schrödinger equation, opening up a new avenue for research.

JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL (2022)

Article Multidisciplinary Sciences

A litmus test for classifying recognition mechanisms of transiently binding proteins

Kalyan S. Chakrabarti, Simon Olsson, Supriya Pratihar, Karin Giller, Kerstin Overkamp, Ko On Lee, Vytautas Gapsys, Kyoung-Seok Ryu, Bert L. de Groot, Frank Noe, Stefan Becker, Donghan Lee, Thomas R. Weikl, Christian Griesinger

Summary: The study presents a theoretical and experimental framework to investigate protein binding mechanisms on sub-millisecond timescales. Using nuclear magnetic resonance and molecular dynamics simulations, the authors find that the binding mechanism between ubiquitin and the SH3 domain is based on conformational selection.

NATURE COMMUNICATIONS (2022)

Article Chemistry, Physical

Quantum dynamics using path integral coarse-graining

Felix Musil, Iryna Zaporozhets, Frank Noe, Cecilia Clementi, Venkat Kapil

Summary: This research develops a method for accurately calculating vibrational spectra of molecular systems using a reduced computational cost path-integral formulation. By leveraging advances in machine-learned coarse-graining and a simple temperature elevation scheme, significant computational savings and improved accuracy are achieved compared to more expensive reference approaches. This method has the potential for routine calculations of vibrational spectra for a wide range of molecular systems with an explicit treatment of the quantum nature of nuclei.

JOURNAL OF CHEMICAL PHYSICS (2022)

Article Biochemistry & Molecular Biology

Markov field models: Scaling molecular kinetics approaches to large molecular machines

Tim Hempel, Simon Olsson, Frank Noe

Summary: With recent advances in structural biology, scalable molecular dynamics methods are required for large biomolecular systems. Current approaches focus on global state modeling, but are not applicable to large-scale systems. To address this, we propose using a set of coupled models to describe the local structure of molecular systems. Markov field models, including various models, are evaluated for their use in computational molecular biology.

CURRENT OPINION IN STRUCTURAL BIOLOGY (2022)

Article Mathematics, Applied

Finite-Data Error Bounds for Koopman-Based Prediction and Control

Feliks Nueske, Sebastian Peitz, Friedrich Philipp, Manuel Schaller, Karl Worthmann

Summary: In this paper, probabilistic bounds for the approximation error and the prediction error are derived for dynamical (control) systems using the Koopman operator. The analysis is extended to (stochastic) nonlinear control-affine systems. A previously proposed approach is proven to be effective in avoiding the curse of dimensionality. The effectiveness of the approach is demonstrated through comparisons with state-of-the-art techniques.

JOURNAL OF NONLINEAR SCIENCE (2023)

Article Multidisciplinary Sciences

Deep learning to decompose macromolecules into independent Markovian domains

Andreas Mardt, Tim Hempel, Cecilia Clementi, Frank Noe

Summary: This study addresses the challenge of modeling the dynamics of large molecular systems by introducing a method that simultaneously decomposes and models the system, providing an effective summary of the complex dynamics. While the issue of learning the dynamical coupling between subsystems still remains, it is a significant step towards learning Ising models of large molecular complexes from simulation data.

NATURE COMMUNICATIONS (2022)

Article Chemistry, Multidisciplinary

Slicing and Dicing: Optimal Coarse-Grained Representation to Preserve Molecular Kinetics

Wangfei Yang, Clark Templeton, David Rosenberger, Andreas Bittracher, Feliks Nueske, Frank Noe, Cecilia Clementi

Summary: The aim of molecular coarse-graining approaches is to simulate the physical properties of a molecular system more efficiently using a lower-resolution model. In this article, we argue that accurate coarse-grained models in soft matter contexts should accurately capture rare-event transitions to reproduce the system's long-time dynamics. We propose a bottom-up coarse-graining scheme that preserves the relevant slow degrees of freedom, and demonstrate its effectiveness in three systems of increasing complexity.

ACS CENTRAL SCIENCE (2023)

Article Chemistry, Physical

Efficient approximation of molecular kinetics using random Fourier features

Feliks Nueske, Stefan Klus

Summary: The use of random Fourier features as a stochastic approximation method allows for more efficient estimation of slow kinetic processes.

JOURNAL OF CHEMICAL PHYSICS (2023)

Article Chemistry, Physical

DeepQMC: An open-source software suite for variational optimization of deep-learning molecular wave functions

Z. Schaetzle, P. B. Szabo, M. Mezera, J. Hermann, F. Noe

Summary: Computing accurate and efficient approximations to solve the Schrödinger equation in computational chemistry has been a challenge for decades. Quantum Monte Carlo methods, with their highly parallel and scalable algorithm, show promise in achieving high accuracy in a variety of molecular systems. The use of machine-learned parametrizations, relying on neural networks as universal function approximators, has further improved the accuracy of these methods. The development of software libraries like DEEPQMC aims to provide a common framework for future investigations and make this field accessible to practitioners from both the quantum chemistry and machine learning communities.

JOURNAL OF CHEMICAL PHYSICS (2023)

Review Chemistry, Multidisciplinary

Ab initio quantum chemistry with neural-network wavefunctions

Jan Hermann, James Spencer, Kenny Choo, Antonio Mezzacapo, W. M. C. Foulkes, David Pfau, Giuseppe Carleo, Frank Noe

Summary: Deep learning methods have surpassed human capabilities in pattern recognition and data processing, and have become increasingly important in scientific discovery. In molecular science, a key application of machine learning is to learn potential energy surfaces or force fields from ab initio solutions of the electronic Schrodinger equation obtained with various quantum chemistry methods. This review discusses a complementary approach that uses machine learning to directly solve quantum chemistry problems from first principles, focusing on quantum Monte Carlo methods with neural-network ansatzes to solve the electronic Schrodinger equation.

NATURE REVIEWS CHEMISTRY (2023)

Article Physics, Multidisciplinary

Pareto-optimal cycles for power, efficiency and fluctuations of quantum heat engines using reinforcement learning

Paolo A. Erdman, Alberto Rolandi, Paolo Abiuso, Marti Perarnau-Llobet, Frank Noe

Summary: The full optimization of a quantum heat engine requires trade-offs between power, efficiency, and fluctuations. A general framework is proposed to identify Pareto-optimal cycles that balance these objectives. Reinforcement learning is used to find the Pareto front of a quantum dot-based engine, revealing abrupt changes in optimal cycles when switching between optimizing two and three objectives. Analytical results accurately describe different regions of the Pareto front in fast- and slow-driving regimes.

PHYSICAL REVIEW RESEARCH (2023)

Article Physics, Multidisciplinary

Temperature steerable flows and Boltzmann generators

Manuel Dibak, Leon Klein, Andreas Kraemer, Frank Noe

Summary: Boltzmann generators solve the sampling problem in many-body physics by combining a normalizing flow and a statistical reweighting method. Temperature steerable flows (TSFs) are proposed to generate a family of probability densities parametrized by a choosable temperature parameter, allowing for sampling of a physical system across multiple thermodynamic states.

PHYSICAL REVIEW RESEARCH (2022)

Article Computer Science, Artificial Intelligence

Generating stable molecules using imitation and reinforcement learning

Soren Ager Meldgaard, Jonas Koehler, Henrik Lund Mortensen, Mads-Peter Christiansen, Frank Noe, Bjork Hammer

Summary: This study proposes a reinforcement learning approach for generating molecules in chemical space and predicting their stability using quantum chemistry. By combining imitation learning and reinforcement learning, the sample efficiency is improved, and low energy molecules are generated under different stoichiometries conditions.

MACHINE LEARNING-SCIENCE AND TECHNOLOGY (2022)

暂无数据