4.6 Article

Knowledge-Embedded Message-Passing Neural Networks: Improving Molecular Property Prediction with Human Knowledge

期刊

ACS OMEGA
卷 6, 期 42, 页码 27955-27967

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acsomega.1c03839

关键词

-

资金

  1. Showa Denko Materials Co., Ltd.

向作者/读者索取更多资源

The study extends a message-passing neural network (MPNN) to introduce a novel architecture called the knowledge-embedded MPNN (KEMPNN), which incorporates nonquantitative knowledge annotations from human experts on a chemical graph containing information on important substructures of molecules. Performance evaluation of KEMPNN in a small training data setting using physical chemistry datasets and polymer property datasets showed that knowledge supervision can enhance prediction accuracy compared to traditional descriptor-based methods, even with limited training data.
The graph neural network (GNN) has become a promising method to predict molecular properties with end-to-end supervision, as it can learn molecular features directly from chemical graphs in a black-box manner. However, to achieve high prediction accuracy, it is essential to supervise a huge amount of property data, which is often accompanied by a high property experiment cost. Prior to the deep learning method, descriptor-based quantitative structure-property relationships (QSPR) studies have investigated physical and chemical knowledge to manually design descriptors for effectively predicting properties. In this study, we extend a message-passing neural network (MPNN) to include a novel MPNN architecture called the knowledge-embedded MPNN (KEMPNN) that can be supervised together with nonquantitative knowledge annotations by human experts on a chemical graph that contains information on the important substructure of a molecule and its effect on the target property (e.g., positive or negative effect). We evaluated the performance of the KEMPNN in a small training data setting using a physical chemistry dataset in MoleculeNet (ESOL, FreeSolv, Lipophilicity) and a polymer property (glass-transition temperature) dataset with virtual knowledge annotations. The results demonstrate that the KEMPNN with knowledge supervision can improve the prediction accuracy obtained from the MPNN. The results also demonstrate that the accuracy of the KEMPNN is better than or comparable to those of descriptor-based methods even in the case of small training data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据