4.2 Article

Computationally Probing Drug-Protein Interactions Via Support Vector Machine

期刊

LETTERS IN DRUG DESIGN & DISCOVERY
卷 7, 期 5, 页码 370-378

出版社

BENTHAM SCIENCE PUBL LTD
DOI: 10.2174/157018010791163433

关键词

Drug-target interaction; Chemical structure; Protein sequence; Imbalance problem; Support vector machine

资金

  1. Key Project of the National Natural Science Foundation of China [10631070, 10801131, 10801112, 10971223]
  2. Ph.D Graduate Start Research Foundation of Xinjiang University [BS080101]

向作者/读者索取更多资源

The past decades witnessed extensive efforts to study the relationships among small molecules (drugs, metabolites, or ligands) and proteins due to the scale and complexity of their physical and genetic interactions. Particularly, computationally predicting the drug-protein interactions is fundamentally important in speeding up the process of developing novel therapeutic agents. Here, we present a supervised learning method, support vector machine (SVM), to predict drug-protein interactions by introducing two machine learning ideas. Firstly, the chemical structure similarity among drugs and the genomic sequence similarity among proteins are intuitively encoded as a feature vector to represent a given drug-protein pair. Secondly, we design an automatic procedure to select a gold-standard negative dataset to deal with the training data imbalance issue, i.e., gold-standard positive data is scarce relative to large scale unlabeled data. Our SVM based predictor is validated on four classes of drug target proteins, including enzymes, ion channels, G-protein couple receptors, and nuclear receptors. We find that our method improves the existing methods regarding to true positive rate upon given false positive rate. The functional annotation analysis and database search indicate that our new predictions are worthy of future experimental validation. In addition, follow-up analysis suggests that our method can partly capture the topological features in the drug-protein interaction network. In conclusion, our new method can efficiently identify the potential drug-protein bindings and will promote the further research in drug discovery.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据