4.3 Article

A machine learning approach for ranking clusters of docked protein-protein complexes by pairwise cluster comparison

期刊

出版社

WILEY
DOI: 10.1002/prot.25218

关键词

CAPRI; protein complex prediction; scoring; machine learning

资金

  1. Francis Crick Institute
  2. Cancer Research UK [FC001003]
  3. UK Medical Research Council [FC001003]
  4. Wellcome Trust [FC001003]
  5. BBSRC Future Leader Fellowship [BB/N011600/1]
  6. BBSRC [BB/N011600/1] Funding Source: UKRI
  7. Biotechnology and Biological Sciences Research Council [BB/N011600/1] Funding Source: researchfish
  8. Cancer Research UK [10748] Funding Source: researchfish

向作者/读者索取更多资源

Reliable identification of near-native poses of docked protein-protein complexes is still an unsolved problem. The intrinsic heterogeneity of protein-protein interactions is challenging for traditional biophysical or knowledge based potentials and the identification of many false positive binding sites is not unusual. Often, ranking protocols are based on initial clustering of docked poses followed by the application of an energy function to rank each cluster according to its lowest energy member. Here, we present an approach of cluster ranking based not only on one molecular descriptor (e.g., an energy function) but also employing a large number of descriptors that are integrated in a machine learning model, whereby, an extremely randomized tree classifier based on 109 molecular descriptors is trained. The protocol is based on first locally enriching clusters with additional poses, the clusters are then characterized using features describing the distribution of molecular descriptors within the cluster, which are combined into a pairwise cluster comparison model to discriminate near-native from incorrect clusters. The results show that our approach is able to identify clusters containing near-native protein-protein complexes. In addition, we present an analysis of the descriptors with respect to their power to discriminate near native from incorrect clusters and how data transformations and recursive feature elimination can improve the ranking performance. (C) 2016 The Authors Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据