4.6 Article

A Fragmentation-Based Graph Embedding Framework for QM/ML

期刊

JOURNAL OF PHYSICAL CHEMISTRY A
卷 125, 期 31, 页码 6872-6880

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jpca.1c06152

关键词

-

资金

  1. National Science Foundation at Indiana University [CHE-1665427, CHE-2102583]
  2. Lilly Endowment, Inc.

向作者/读者索取更多资源

This study introduces a new fragmentation-based molecular representation framework FragGraph for QM/ML methods, aimed at achieving high accuracy by correcting the deficiencies of approximate methods through ideas from fragmentation, error cancellation, and deep learning architecture. The framework combines the advantages of error cancellation methods into molecular machine learning by incorporating a general graph-network framework.
We introduce a new fragmentation-based molecular representation framework FragGraph for QM/ML methods involving embedding fragment-wise fingerprints onto molecular graphs. Our model is specifically designed for delta machine learning (Delta-ML) with the central goal of correcting the deficiencies of approximate methods such as DFT to achieve high accuracy. Our framework is based on a judicious combination of ideas from fragmentation, error cancellation, and a state-of-the-art deep learning architecture. Broadly, we develop a general graph-network framework for molecular machine learning by incorporating the inherent advantages prebuilt into error cancellation methods such as the generalized Connectivity-Based Hierarchy. More specifically, we develop a QM/ML representation through a fragmentationbased attributed graph representation encoded with fragment-wise molecular fingerprints. The utility of our representation is demonstrated through a graph network fingerprint encoder in which a global fingerprint is generated through message passing of local neighborhoods of fragment-wise fingerprints, effectively augmenting standard fingerprints to also include the inbuilt molecular graph structure. On the 130k-GDB9 dataset, our method predicts an out-of-sample mean absolute error significantly lower than 1 kJ/mol compared to target G4(MP2) calculated energies, rivaling current deep learning methods with reduced computational scaling.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据