4.7 Article

Isoform-Level Interpretation of High-Throughput Proteomics Data Enabled by Deep Integration with RNA-seq

期刊

JOURNAL OF PROTEOME RESEARCH
卷 17, 期 10, 页码 3431-3444

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jproteome.8b00310

关键词

RNA-seq; ribosome profiling; mass spectrometry; peptides; isoforms; proteogenomics; HEK293; brain; expectation maximization; integrative analysis

资金

  1. NIDA [R21DA040454]
  2. NIMH [U01MH103339, MH106934, MH11092]
  3. Yale/NIDA Neuroproteomics Centre [DA018343]
  4. Center for Molecular and Epigenetic Research of Cell Types Mediating Addictive Behaviors at The Rockefeller University [5P30DA035756]
  5. NIH SIG grants [1S10OD019967-0, 1S10ODOD018034-01]
  6. State of Connecticut, Department of Mental Health & Addiction Services
  7. Brain & Behavior Research Foundation
  8. NATIONAL HUMAN GENOME RESEARCH INSTITUTE [U24HG009446] Funding Source: NIH RePORTER
  9. NATIONAL INSTITUTE OF MENTAL HEALTH [U01MH103339, P50MH106934, R01MH110926] Funding Source: NIH RePORTER
  10. NATIONAL INSTITUTE ON DRUG ABUSE [P30DA035756, R21DA040454, P30DA018343] Funding Source: NIH RePORTER

向作者/读者索取更多资源

Cellular control of gene expression is a complex process that is subject to multiple levels of regulation, but ultimately it is the protein produced that determines the biosynthetic state of the cell. One way that a cell can regulate the protein output from each gene is by expressing alternate isoforms with distinct amino acid sequences. These isoforms may exhibit differences in localization and binding interactions that can have profound functional implications. High throughput liquid chromatography tandem mass spectrometry proteomics (LC-MS/MS) relies on enzymatic digestion and has lower coverage and sensitivity than transcriptomic profiling methods such as RNA-seq. Digestion results in predictable fragmentation of a protein, which can limit the generation of peptides capable of distinguishing between isoforms. Here we exploit transcript-level expression from RNA-seq to set prior likelihoods and enable protein isoform abundances to be directly estimated from LC-MS/MS, an approach derived from the principle that most genes appear to be expressed as a single dominant isoform in a given cell type or tissue. Through this deep integration of RNA-seq and LC-MS/MS data from the same sample, we show that a principal isoform can be identified in >80% of gene products in homogeneous HEK293 cell culture and >70% of proteins detected in complex human brain tissue. We demonstrate that the incorporation of translatome data from ribosome profiling further refines this process. Defining isoforms in experiments with matched RNA-seq/translatome and proteomic data increases the functional relevance of such data sets and will further broaden our understanding of multilevel control of gene expression.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据