4.7 Article

AlphaFold2-aware protein-DNA binding site prediction using graph transformer

期刊

BRIEFINGS IN BIOINFORMATICS
卷 23, 期 2, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbab564

关键词

protein-DNA binding site; AlphaFold2; predicted protein structure; graph transformer

资金

  1. National Key R&D Program of China [2020YFB0204803]
  2. National Natural Science Foundation of China [61772566, 62041209]
  3. Guangdong Key Field RD Plan [2019B020228001, 2018B010109006, 2016ZT06D211]
  4. Guangzhou ST Research Plan [202007030010]

向作者/读者索取更多资源

In this study, a precise predictor called GraphSite based on AlphaFold2 is proposed for identifying DNA-binding residues from protein structural models. By employing a graph transformer and leveraging predicted protein structures, GraphSite significantly improves the accuracy of DNA binding site prediction.
Protein-DNA interactions play crucial roles in the biological systems, and identifying protein-DNA binding sites is the first step for mechanistic understanding of various biological activities (such as transcription and repair) and designing novel drugs. How to accurately identify DNA-binding residues from only protein sequence remains a challenging task. Currently, most existing sequence-based methods only consider contextual features of the sequential neighbors, which are limited to capture spatial information. Based on the recent breakthrough in protein structure prediction by AlphaFold2, we propose an accurate predictor, GraphSite, for identifying DNA-binding residues based on the structural models predicted by AlphaFold2. Here, we convert the binding site prediction problem into a graph node classification task and employ a transformer-based variant model to take the protein structural information into account. By leveraging predicted protein structures and graph transformer, GraphSite substantially improves over the latest sequence-based and structure-based methods. The algorithm is further confirmed on the independent test set of 181 proteins, where GraphSite surpasses the state-of-the-art structure-based method by 16.4% in area under the precision-recall curve and 11.2% in Matthews correlation coefficient, respectively. We provide the datasets, the predicted structures and the source codes along with the pre-trained models of GraphSite at http://github.com/biomed-AI/GraphSite.. The GraphSite web server is freely available at .

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据