Journal
TRENDS IN BIOTECHNOLOGY
Volume 29, Issue 4, Pages 174-182Publisher
ELSEVIER SCIENCE LONDON
DOI: 10.1016/j.tibtech.2011.01.001
Keywords
-
Categories
Funding
- U.S. Department of Energy [DE-FG02-07ER64498]
- National Institutes of Health [R01 GM70641-01]
Ask authors/readers for more resources
Nearly 2200 genomes that encode around 6 million proteins have now been sequenced. Around 40% of these proteins are of unknown function, even when function is loosely and minimally defined as 'belonging to a superfamily'. In addition to in silico methods, the swelling stream of high-throughput experimental data can give valuable clues for linking these unknowns with precise biological roles. The goal is to develop integrative data-mining platforms that allow the scientific community at large to access and utilize this rich source of experimental knowledge. To this end, we review recent advances in generating whole-genome experimental datasets, where this data can be accessed, and how it can be used to drive prediction of gene function.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available