4.3 Article

An accurate feature-based method for identifying DNA-binding residues on protein surfaces

期刊

出版社

WILEY-BLACKWELL
DOI: 10.1002/prot.22898

关键词

protein-DNA interaction; structural feature; B-factor; packing density; support vector machine

资金

  1. National Natural Science Foundation of China [60773010, 60970063, 20773085, 30870476]
  2. Ph.D. Programs Foundation of Ministry of Education of China [20090141110026]
  3. National 863 Bioinformatics Projects [2007AA02Z333]

向作者/读者索取更多资源

Proteins that interact with DNA play vital roles in all mechanisms of gene expression and regulation. In order to understand these activities, it is crucial to analyze and identify DNA-binding residues on DNA-binding protein surfaces. Here, we proposed two novel features B-factor and packing density in combination with several conventional features to characterize the DNA-binding residues in a well-constructed representative dataset of 119 protein-DNA complexes from the Protein Data Bank (PDB). Based on the selected features, a prediction model for DNA-binding residues was constructed using support vector machine (SVM). The predictor was evaluated using a 5-fold cross validation on above dataset of 123 DNA-binding proteins. Moreover, two independent datasets of 83 DNA-bound protein structures and their corresponding DNA-free forms were compiled. The B-factor and packing density features were statistically analyzed on these 83 pairs of holo-apo proteins structures. Finally, we developed the SVM model to accurately predict DNA-binding residues on protein surface, given the DNA-free structure of a protein. Results showed here indicate that our method represents a significant improvement of previously existing approaches such as DISPLAR. The observation suggests that our method will be useful in studying protein-DNA interactions to guide consequent works such as site-directed mutagenesis and protein-DNA docking. Proteins 2011; 79:509-517. (C) 2010 Wiley-Liss, Inc.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据