期刊
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN
卷 30, 期 11, 页码 1019-1033出版社
SPRINGER
DOI: 10.1007/s10822-016-9928-x
关键词
SAMPL5; Distribution coefficient; Solvation free energy; Extended solvent-contact model; Genetic algorithm
资金
- Creative Materials Discovery Program through the National Research Foundation of Korea - Ministry of Science, ICT and Future Planning [2015M3D1A1069705]
- Basic Science Research Program through the National Research Foundation of Korea - Ministry of Education [NRF-2016R1D1A1B01014187]
The performance of the extended solvent-contact model has been addressed in the SAMPL5 blind prediction challenge for distribution coefficient (LogD) of drug-like molecules with respect to the cyclohexane/water partitioning system. All the atomic parameters defined for 41 atom types in the solvation free energy function were optimized by operating a standard genetic algorithm with respect to water and cyclohexane solvents. In the parameterizations for cyclohexane, the experimental solvation free energy (Delta G (sol) ) data of 15 molecules for 1-octanol were combined with those of 77 molecules for cyclohexane to construct a training set because Delta G (sol) values of the former were unavailable for cyclohexane in publicly accessible databases. Using this hybrid training set, we established the LogD prediction model with the correlation coefficient (R), average error (AE), and root mean square error (RMSE) of 0.55, 1.53, and 3.03, respectively, for the comparison of experimental and computational results for 53 SAMPL5 molecules. The modest accuracy in LogD prediction could be attributed to the incomplete optimization of atomic solvation parameters for cyclohexane. With respect to 31 SAMPL5 molecules containing the atom types for which experimental reference data for Delta G (sol) were available for both water and cyclohexane, the accuracy in LogD prediction increased remarkably with the R, AE, and RMSE values of 0.82, 0.89, and 1.60, respectively. This significant enhancement in performance stemmed from the better optimization of atomic solvation parameters by limiting the element of training set to the molecules with experimental Delta G (sol) data for cyclohexane. Due to the simplicity in model building and to low computational cost for parameterizations, the extended solvent-contact model is anticipated to serve as a valuable computational tool for LogD prediction upon the enrichment of experimental Delta G (sol) data for organic solvents.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据