4.4 Article

The SAMPL6 challenge on predicting aqueous pKa values from EC-RISM theory

期刊

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN
卷 32, 期 10, 页码 1151-1163

出版社

SPRINGER
DOI: 10.1007/s10822-018-0140-z

关键词

SAMPL6; Solvation model; Quantum chemistry; Integral equation theory; EC-RISM; pK(a)

资金

  1. Cluster of Excellence RESOLV [EXC 1069]
  2. Deutsche Forschungsgemeinschaft [FOR 1979]

向作者/读者索取更多资源

The embedded cluster reference interaction site model (EC-RISM) integral equation theory is applied to the problem of predicting aqueous pK(a) values for drug-like molecules based on an ensemble of tautomers. EC-RISM is based on self-consistent calculations of a solute's electronic structure and the distribution function of surrounding water. Following-up on the workflow developed after the SAMPL5 challenge on cyclohexane-water distribution coefficients we extended and improved the methodology by taking into account exact electrostatic solute-solvent interactions taken from the wave function in solution. As before, the model is calibrated against Gibbs energies of hydration from the Minnesota Solvation Database and a public dataset of acidity constants of organic acids and bases by adjusting in total 4 parameters, among which only 3 are relevant for predicting pK(a) values. While the best-performing training model yields a root-mean-square error (RMSE) of 1pK unit, the corresponding test set prediction on the full SAMPL6 dataset of macroscopic pK(a) values using the same level of theory exhibits slightly larger error (1.7pK units) than the best test set model submitted (1.7pK units for corresponding training set vs. test set performance of 1.6). Post-submission analysis revealed a number of physical optimization options regarding the numerical treatment of electrostatic interactions and conformational sampling. While the experimental test set data revealed after submission was not used for reparametrizing the methodology, the best physically optimized models consequentially result in RMSEs of 1.5 if only improved electrostatic interactions are considered and of 1.1 if, in addition, conformational sampling accounts for quantum-chemically derived rankings. We conclude that these numbers are probably near the ultimate accuracy achievable with the simple 3-parameter model using a single or the two best-ranking conformations per tautomer or microstate. Finally, relations of the present macrostate approach to microstate pK(a) results are discussed and some illustrative results for microstate populations are presented.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据