4.7 Article

Open Source Bayesian Models. 3. Composite Models for Prediction of Binned Responses

期刊

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.5b00555

关键词

-

资金

  1. Biocomputation across distributed private datasets to enhance drug discovery from NIH National Center for Advancing Translational Sciences [9R44TR000942-02]
  2. NIH National Institute of Allergy and Infectious Diseases Identification and validation of targets of phenotypic high throughput screening [R41-AI108003-01]

向作者/读者索取更多资源

Bayesian models constructed from structure derived fingerprints have been a popular and useful method for drug discovery research when applied to bioactivity measurements that can be effectively classified as active or inactive. The results can be used to rank candidate structures according to their probability of activity, and this ranking benefits from the high degree of interpretability when structure-based fingerprints are used, making the results chemically intuitive. Besides selecting an activity threshold, building a Bayesian model is fast and requires few or no parameters or user intervention. The method also does not suffer from such acute overtraining problems as quantitative structure activity relationships or quantitative structure property relationships (QSAR/QSPR). This makes it an approach highly suitable for automated workflows that are independent of user expertise or prior knowledge of the training data. We now describe a new method for creating a composite group of Bayesian models to extend the method to work with multiple states, rather than just binary. Incoming activities are divided into bins, each covering a mutually exclusive range of activities. For each of these bins, a Bayesian model is created to model whether or not the compound belongs in the bin. Analyzing putative molecules using the composite model involves making a prediction for each bin and examining the relative likelihood for each assignment, for example, highest value wins. The method has been evaluated on a collection of hundreds of data sets extracted from ChEMBL v20 and validated data sets for ADME/Tox and bioactivity.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据