☆ 4.7 Article

Flexible Data Analysis Pipeline for High-Confidence Proteogenomics

JOURNAL OF PROTEOME RESEARCH (2016)

期刊

JOURNAL OF PROTEOME RESEARCH

卷 15, 期 12, 页码 4686-4695

出版社

AMER CHEMICAL SOC

DOI: 10.1021/acs.jproteome.6b00765

关键词

proteogenomics; bioinformatics; workflow; mass spectrometry; genome annotation; testis

类别

Biochemical Research Methods

资金

Wellcome Trust [WT098051]
National Institutes of Health [U41HG007234]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Proteogenomics leverages information derived from proteomic data to improve genome annotations. Of particular interest are novel peptides that provide direct evidence of protein expression for genomic regions not previously annotated as protein-coding. We present a modular, automated data analysis pipeline aimed at detecting such novel peptides in proteomic data sets. This pipeline implements criteria developed by proteomics and genome annotation experts for high-stringency peptide identification and filtering. Our pipeline is based on the OpenMS computational framework; it incorporates multiple database search engines for peptide identification and applies a machine-learning approach (Percolator) to post-process search results. We describe several new and improved software tools that we developed to facilitate proteogenomic analyses that enhance the wealth of tools provided by OpenMS. We demonstrate the application of our pipeline to a human testis tissue data set previously acquired for the Chromosome-Centric Human Proteome Project, which led to the addition of five new gene annotations on the human reference genome.

Flexible Data Analysis Pipeline for High-Confidence Proteogenomics

期刊

JOURNAL OF PROTEOME RESEARCH

出版社

AMER CHEMICAL SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Flexible Data Analysis Pipeline for High-Confidence Proteogenomics

期刊

JOURNAL OF PROTEOME RESEARCH

出版社

AMER CHEMICAL SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文