4.7 Article

Genome-scale prediction of moonlighting proteins using diverse protein association information

Journal

BIOINFORMATICS
Volume 32, Issue 15, Pages 2281-2288

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btw166

Keywords

-

Funding

  1. National Institute of General Medical Sciences of the National Institutes of Health [R01GM097528]
  2. National Science Foundation [IIS1319551, DBI1262189, IOS1127027]
  3. Direct For Biological Sciences
  4. Div Of Biological Infrastructure [1262189] Funding Source: National Science Foundation

Ask authors/readers for more resources

Motivation: Moonlighting proteins (MPs) showmultiple cellular functions within a single polypeptide chain. To understand the overall landscape of their functional diversity, it is important to establish a computational method that can identify MPs on a genome scale. Previously, we have systematically characterizedMPs using functional and omics-scale information. In this work, we develop a computational prediction model for automatic identification of MPs using a diverse range of protein association information. Results: We incorporated a diverse range of protein association information to extract characteristic features of MPs, which range from gene ontology (GO), protein-protein interactions, gene expression, phylogenetic profiles, genetic interactions and network-based graph properties to protein structural properties, i.e. intrinsically disordered regions in the protein chain. Then, we used machine learning classifiers using the broad feature space for predicting MPs. Because many known MPs lack some proteomic features, we developed an imputation technique to fill such missing features. Results on the control dataset show that MPs can be predicted with over 98% accuracy when GO terms are available. Furthermore, using only the omics-based features the method can still identify MPs with over 75% accuracy. Last, we applied the method on three genomes: Saccharomyces cerevisiae, Caenorhabditis elegans and Homo sapiens, and found that about 2-10% of proteins in the genomes are potential MPs. Availability and Implementation: Code available at http://kiharalab.org/MPprediction

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available