4.4 Article

Extended solvent-contact model approach to blind SAMPL5 prediction challenge for the distribution coefficients of drug-like molecules

期刊

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN
卷 30, 期 11, 页码 1019-1033

出版社

SPRINGER
DOI: 10.1007/s10822-016-9928-x

关键词

SAMPL5; Distribution coefficient; Solvation free energy; Extended solvent-contact model; Genetic algorithm

资金

  1. Creative Materials Discovery Program through the National Research Foundation of Korea - Ministry of Science, ICT and Future Planning [2015M3D1A1069705]
  2. Basic Science Research Program through the National Research Foundation of Korea - Ministry of Education [NRF-2016R1D1A1B01014187]

向作者/读者索取更多资源

The performance of the extended solvent-contact model has been addressed in the SAMPL5 blind prediction challenge for distribution coefficient (LogD) of drug-like molecules with respect to the cyclohexane/water partitioning system. All the atomic parameters defined for 41 atom types in the solvation free energy function were optimized by operating a standard genetic algorithm with respect to water and cyclohexane solvents. In the parameterizations for cyclohexane, the experimental solvation free energy (Delta G (sol) ) data of 15 molecules for 1-octanol were combined with those of 77 molecules for cyclohexane to construct a training set because Delta G (sol) values of the former were unavailable for cyclohexane in publicly accessible databases. Using this hybrid training set, we established the LogD prediction model with the correlation coefficient (R), average error (AE), and root mean square error (RMSE) of 0.55, 1.53, and 3.03, respectively, for the comparison of experimental and computational results for 53 SAMPL5 molecules. The modest accuracy in LogD prediction could be attributed to the incomplete optimization of atomic solvation parameters for cyclohexane. With respect to 31 SAMPL5 molecules containing the atom types for which experimental reference data for Delta G (sol) were available for both water and cyclohexane, the accuracy in LogD prediction increased remarkably with the R, AE, and RMSE values of 0.82, 0.89, and 1.60, respectively. This significant enhancement in performance stemmed from the better optimization of atomic solvation parameters by limiting the element of training set to the molecules with experimental Delta G (sol) data for cyclohexane. Due to the simplicity in model building and to low computational cost for parameterizations, the extended solvent-contact model is anticipated to serve as a valuable computational tool for LogD prediction upon the enrichment of experimental Delta G (sol) data for organic solvents.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据