4.7 Article

Fast Prediction of Lipophilicity of Organofluorine Molecules: Deep Learning-Derived Polarity Characters and Experimental Tests

期刊

JOURNAL OF CHEMICAL INFORMATION AND MODELING
卷 62, 期 20, 页码 4928-4936

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.2c01201

关键词

-

资金

  1. National Key Research and Development Program of China
  2. National Natural Science Foundation of China
  3. High Performance Computing Centre of Nanjing University
  4. [2019YFC0408303]
  5. [22033004]
  6. [21873045]

向作者/读者索取更多资源

Fast and accurate estimation of the lipophilicity of organofluorine molecules is in high demand. An efficient model called PoLogP is developed to predict the lipophilicity of these molecules based on the combination of polarity descriptors and hydrogen bond index. The model utilizes a multilevel attention graph convolutional neural network to generate polarity descriptors quickly. Experimental results demonstrate that PoLogP outperforms the dipole moment method in predicting the lipophilicity of organofluorine molecules.
Fast and accurate estimation of lipophilicity for organofluorine molecules is in great demand for accelerating drug and materials discovery. A lipophilicity data set of organofluorine molecules (OFL data set), containing 1907 samples, is constructed through density functional theory (DFT) calculations and experimental measurements. An efficient and interpretable model, called PoLogP, is developed to predict the n-octanol/water partition coefficient, log P-o/w, of organofluorine molecules on the basis of the descriptors of polarization, which is a combination of polarity descriptors, including the molecular polarity index and molecular polarizability (alpha), and hydrogen bond (HBs) index, consisting of the number of donors (N-HB(D)) and acceptors (N-HB(A) and N-HB-F(A)). The present PoLogP with a combination of polarity descriptors is demonstrated to perform better than the dipole moment (mu) alone for the F-contained molecules. With the aid of a multilevel attention graph convolutional neural network model, the fast generation of polarity descriptors of organofluorine molecules could be achieved with the DFT accuracy based only on a topological molecular graph structure. The performance of PoLogP is further validated on synthesized organofluorine molecules and 2626 non-fluorinated molecules with satisfactory accuracy, highlighting the potential usage of PoLogP in high-throughput screening of the functional molecules with the desired solubility in various solvent media.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据