4.7 Article

Comparing a Query Compound with Drug Target Classes Using 3D-Chemical Similarity

Journal

Publisher

MDPI
DOI: 10.3390/ijms21124208

Keywords

Kullback-Leibler (K-L) divergence; chemocentric similarity; Jaccard-Tanimoto coefficient; Gaussian mixture model (GMM); expectation-maximization (EM) algorithm; maximum likelihood (ML) estimation; machine learning

Funding

  1. Basic Science Research Program of the National Research Foundation of Korea (NRF) - Ministry of Education, Science, and Technology [2017R1E1A1A01076642]
  2. Agency for Defense Development [PD1806130GD]

Ask authors/readers for more resources

3D similarity is useful in predicting the profiles of unprecedented molecular frameworks that are 2D dissimilar to known compounds. When comparing pairs of compounds, 3D similarity of the pairs depends on conformational sampling, the alignment method, the chosen descriptors, and the similarity coefficients. In addition to these four factors, 3D chemocentric target prediction of an unknown compound requires compound-target associations, which replace compound-to-compound comparisons with compound-to-target comparisons. In this study, quantitative comparison of query compounds to target classes (one-to-group) was achieved via two types of 3D similarity distributions for the respective target class with parameter optimization for the fitting models: (1) maximum likelihood (ML) estimation of queries, and (2) the Gaussian mixture model (GMM) of target classes. While Jaccard-Tanimoto similarity of query-to-ligand pairs with 3D structures (sampled multi-conformers) can be transformed into query distribution using ML estimation, the ligand pair similarity within each target class can be transformed into a representative distribution of a target class through GMM, which is hyperparameterized via the expectation-maximization (EM) algorithm. To quantify the discriminativeness of a query ligand against target classes, the Kullback-Leibler (K-L) divergence of each query was calculated and compared between targets. 3D similarity-based K-L divergence together with the probability and the feasibility index, (F-m), showed discriminative power with regard to some query-class associations. The K-L divergence of 3D similarity distributions can be an additional method for (1) the rank of the 3D similarity score or (2) thep-value of one 3D similarity distribution to predict the target of unprecedented drug scaffolds.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available