☆ 4.6 Article

A multi-label approach to target prediction taking ligand promiscuity into account

JOURNAL OF CHEMINFORMATICS (2015)

期刊

JOURNAL OF CHEMINFORMATICS

卷 7, 期 -, 页码 -

出版社

BMC

DOI: 10.1186/s13321-015-0071-9

关键词

Multi-label classifications; Ligand promiscuity; Probabilistic classifier

类别

Chemistry, Multidisciplinary Computer Science, Information Systems Computer Science, Interdisciplinary Applications

资金

Centre for Molecular Informatics
Unilever

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Background: According to Cobanoglu et al., it is now widely acknowledged that the single target paradigm (one protein/target, one disease, one drug) that has been the dominant premise in drug development in the recent past is untenable. More often than not, a drug-like compound (ligand) can be promiscuous - it can interact with more than one target protein. In recent years, in in silico target prediction methods the promiscuity issue has generally been approached computationally in three main ways: ligand-based methods; target-protein-based methods; and integrative schemes. In this study we confine attention to ligand-based target prediction machine learning approaches, commonly referred to as target-fishing. The target-fishing approaches that are currently ubiquitous in cheminformatics literature can be essentially viewed as single-label multi-classification schemes; these approaches inherently bank on the single target paradigm assumption that a ligand can zero in on one single target. In order to address the ligand promiscuity issue, one might be able to cast target-fishing as a multi-label multi-class classification problem. For illustrative and comparison purposes, single-label and multi-label Naive Bayes classification models (denoted here by SMM and MMM, respectively) for target-fishing were implemented. The models were constructed and tested on 65,587 compounds/ligands and 308 targets retrieved from the ChEMBL17 database. Results: On classifying 3,332 test multi-label (promiscuous) compounds, SMM and MMM performed differently. At the 0.05 significance level, a Wilcoxon signed rank test performed on the paired target predictions yielded by SMM and MMM for the test ligands gave a p-value < 5.1 x 10(-94) and test statistics value of 6.8 x 10(5), in favour of MMM. The two models performed differently when tested on four datasets comprising single-label (non-promiscuous) compounds; McNemar's test yielded chi(2) values of 15.657, 16.500 and 16.405 (with corresponding p-values of 7.594 x 10(-05), 4.865 x 10(-05) and 5.115 x 10(-05)), respectively, for three test sets, in favour of MMM. The models performed similarly on the fourth set. Conclusions: The target prediction results obtained in this study indicate that multi-label multi-class approaches are more apt than the ubiquitous single-label multi-class schemes when it comes to the application of ligand-based classifiers to target-fishing.

A multi-label approach to target prediction taking ligand promiscuity into account

期刊

JOURNAL OF CHEMINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A multi-label approach to target prediction taking ligand promiscuity into account

期刊

JOURNAL OF CHEMINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文