4.7 Article

Predictive models of aqueous solubility of organic compounds built on A large dataset of high integrity

Journal

BIOORGANIC & MEDICINAL CHEMISTRY
Volume 27, Issue 14, Pages 3110-3114

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.bmc.2019.05.037

Keywords

Kinetic solubility; Atom typing descriptors; Support vector classification (SVC); In silico ADME model; Prediction

Funding

  1. Intramural Research Program of the National Center for Advancing Translational Sciences, National Institutes of Health
  2. NATIONAL CENTER FOR ADVANCING TRANSLATIONAL SCIENCES [ZIATR000369] Funding Source: NIH RePORTER

Ask authors/readers for more resources

Aqueous solubility is one of the most important properties in drug discovery, as it has profound impact on various drug properties, including biological activity, pharmacokinetics (PK), toxicity, and in vivo efficacy. Both kinetic and thermodynamic solubilities are determined during different stages of drug discovery and development. Since kinetic solubility is more relevant in preclinical drug discovery research, especially during the structure optimization process, we have developed predictive models for kinetic solubility with in-house data generated from 11,780 compounds collected from over 200 NCATS intramural research projects. This represents one of the largest kinetic solubility datasets of high quality and integrity. Based on the customized atom type descriptors, the support vector classification (SVC) models were trained on 80% of the whole dataset, and exhibited high predictive performance for estimating the solubility of the remaining 20% compounds within the test set. The values of the area under the receiver operating characteristic curve (AUC-ROC) for the compounds in the test sets reached 0.93 and 0.91, when the threshold for insoluble compounds was set to 10 and 50 mu g/mL respectively. The predictive models of aqueous solubility can be used to identify insoluble compounds in drug discovery pipeline, provide design ideas for improving solubility by analyzing the atom types associated with poor solubility and prioritize compound libraries to be purchased or synthesized.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available