4.7 Article

Dynamics of domain coverage of the protein sequence universe

期刊

BMC GENOMICS
卷 13, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/1471-2164-13-634

关键词

-

资金

  1. Laboratory Directed Research and Development program at the Oak Ridge National Laboratory managed by UT-Battelle, LLC [DE-AC05-00OR22725]
  2. EPSCoR
  3. Office Of The Director [0919436] Funding Source: National Science Foundation

向作者/读者索取更多资源

Background: The currently known protein sequence space consists of millions of sequences in public databases and is rapidly expanding. Assigning sequences to families leads to a better understanding of protein function and the nature of the protein universe. However, a large portion of the current protein space remains unassigned and is referred to as its dark matter. Results: Here we suggest that true size of dark matter is much larger than stated by current definitions. We propose an approach to reducing the size of dark matter by identifying and subtracting regions in protein sequences that are not likely to contain any domain. Conclusions: Recent improvements in computational domain modeling result in a decrease, albeit slowly, in the relative size of dark matter; however, its absolute size increases substantially with the growth of sequence data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据