4.7 Article

Quantum-Based Molecular Dynamics Simulations Using Tensor Cores

Journal

JOURNAL OF CHEMICAL THEORY AND COMPUTATION
Volume 17, Issue 10, Pages 6180-6192

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.jctc.1c00726

Keywords

-

Funding

  1. U.S. Department of Energy Office of Basic Energy Sciences
  2. LANL LDRD-ER program
  3. U.S. Department of Energy through the Los Alamos National Laboratory
  4. Computational Systems and Software Environments (CSSE) subprogram of LANL's ASC program (NNSA/DOE)

Ask authors/readers for more resources

Tensor cores, designed for deep neural network calculations, provide exceptional computational speed and energy efficiency for tensor contractions. Despite limitations, such as low-precision floating-point operations, they can still be efficiently applied to challenging quantum-based Born-Oppenheimer molecular dynamics, achieving high performance and stability.
Tensor cores, along with tensor processing units, represent a new form of hardware acceleration specifically designed for deep neural network calculations in artificial intelligence applications. Tensor cores provide extraordinary computational speed and energy efficiency but with the caveat that they were designed for tensor contractions (matrix-matrix multiplications) using only low-precision floating-point operations. Despite this perceived limitation, we demonstrate how tensor cores can be applied with high efficiency to the challenging and numerically sensitive problem of quantum-based Born-Oppenheimer molecular dynamics, which requires highly accurate electronic structure optimizations and conservative force evaluations. The interatomic forces are calculated on-the-fly from an electronic structure that is obtained from a generalized deep neural network, where the computational structure naturally takes advantage of the exceptional processing power of the tensor cores and allows for high performance in excess of 100 Tflops on a single Nvidia A100 GPU. Stable molecular dynamics trajectories are generated using the framework of extended Lagrangian Born-Oppenheimer molecular dynamics, which combines computational efficiency with long-term stability, even when using approximate charge relaxations and force evaluations that are limited in accuracy by the numerically noisy conditions caused by the low-precision tensor core floating-point operations. A canonical ensemble simulation scheme is also presented, where the additional numerical noise in the calculated forces is absorbed into a Langevin-like dynamics.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Chemistry, Physical

Linear Scaling Pseudo Fermi-Operator Expansion for Fractional Occupation

Susan M. Mniszewski, Romain Perriot, Emanuel H. Rubensson, Christian F. A. Negre, Marc J. Cawkwell, Anders M. N. Niklasson

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2019)

Article Chemistry, Physical

Mixed Precision Fermi-Operator Expansion on Tensor Cores from a Machine Learning Perspective

Joshua Finkelstein, Justin S. Smith, Susan M. Mniszewski, Kipton Barros, Christian F. A. Negre, Emanuel H. Rubensson, Anders M. N. Niklasson

Summary: A second-order recursive Fermi-operator expansion scheme using mixed precision floating point operations on tensor core units is presented, achieving over 100 teraFLOPS of performance for half-precision floating point operations on Nvidia's A100. The scheme is formulated as a generalized, differentiable deep neural network structure, accelerating convergence by optimizing weight and bias values, and optimizing coefficients of the expansion to accurately represent fractional occupation numbers of electronic states at finite temperatures using a machine learning approach.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2021)

Article Computer Science, Interdisciplinary Applications

Sparse approximate matrix-matrix multiplication for density matrix purification with error control

Anton G. Artemov, Emanuel H. Rubensson

Summary: The proposed method utilizes acceleration technique and submatrix product screening to reduce computational cost, with a selected threshold value for error control.

JOURNAL OF COMPUTATIONAL PHYSICS (2021)

Article Chemistry, Physical

Quantum Perturbation Theory Using Tensor Cores and a Deep Neural Network

Joshua Finkelstein, Emanuel H. Rubensson, Susan M. Mniszewski, Christian F. A. Negre, Anders M. N. Niklasson

Summary: In this paper, density matrix perturbation theory is mapped onto the computational structure of a deep neural network, and time-independent quantum response calculations are performed using Tensor cores. The main computational cost of each deep layer is dominated by tensor contractions in mixed-precision arithmetics, achieving close to peak performance. Quantum response calculations are demonstrated and analyzed with self-consistent charge density-functional tight-binding theory and coupled-perturbed Hartree-Fock theory. A novel parameter-free convergence criterion is presented for linear response calculations, suitable for numerically noisy low-precision floating point operations, and a peak performance of almost 200 Tflops is demonstrated using the Tensor cores of two Nvidia A100 GPUs.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2022)

Article Computer Science, Software Engineering

The Chunks and Tasks Matrix Library

Emanuel H. Rubensson, Elias Rudberg, Anastasia Kruchinina, Anton G. Artemov

Summary: We introduce a C++ header-only parallel sparse matrix library based on a sparse quadtree representation and the Chunks and Tasks programming model. This library dynamically exploits data locality to avoid data movement and demonstrates successful distributed memory parallelization of block-sparse matrix-matrix multiplication for matrices with different nonzero structures.

SOFTWAREX (2022)

No Data Available