☆ 4.7 Article

Cloud Parallel Processing of Tandem Mass Spectrometry Based Proteomics Data

JOURNAL OF PROTEOME RESEARCH (2012)

期刊

JOURNAL OF PROTEOME RESEARCH

卷 11, 期 10, 页码 5101-5108

出版社

AMER CHEMICAL SOC

DOI: 10.1021/pr300561q

关键词

proteomics; mass spectrometry; scientific workflow; data decomposition

类别

Biochemical Research Methods

资金

Dutch Organization for Scientific Research (De Nederlandse Organisatie voor Wetenschappelijk Onderzoek, NWO) [NRG-2010.06, BG-043-11, VI-917.11.398]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Data analysis in mass spectrometry based proteomics struggles to keep pace with the advances in instrumentation and the increasing rate of data acquisition. Analyzing this data involves multiple steps requiring diverse software, using different algorithms and data formats. Speed and performance of the mass spectral search engines are continuously improving, although not necessarily as needed to face the challenges of acquired big data. Improving and parallelizing the search algorithms is one possibility; data decomposition presents another, simpler strategy for introducing parallelism. We describe a general method for parallelizing identification of tandem mass spectra using data decomposition that keeps the search engine intact and wraps the parallelization around it. We introduce two algorithms for decomposing mzXML files and recomposing resulting pepXML files. This makes the approach applicable to different search engines, including those relying on sequence databases and those searching spectral libraries. We use cloud computing to deliver the computational power and scientific workflow engines to interface and automate the different processing steps. We show how to leverage these technologies to achieve faster data analysis in proteomics and present three scientific workflows for parallel database as well as spectral library search using our data decomposition programs, X!Tandem and SpectraST.

Cloud Parallel Processing of Tandem Mass Spectrometry Based Proteomics Data

期刊

JOURNAL OF PROTEOME RESEARCH

出版社

AMER CHEMICAL SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Cloud Parallel Processing of Tandem Mass Spectrometry Based Proteomics Data

期刊

JOURNAL OF PROTEOME RESEARCH

出版社

AMER CHEMICAL SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文