4.7 Article

Processing Shotgun Proteomics Data on the Amazon Cloud with the Trans-Proteomic Pipeline

期刊

MOLECULAR & CELLULAR PROTEOMICS
卷 14, 期 2, 页码 399-404

出版社

AMER SOC BIOCHEMISTRY MOLECULAR BIOLOGY INC
DOI: 10.1074/mcp.O114.043380

关键词

-

资金

  1. National Institute of General Medical Sciences [R01 GM087221, 2P50 GM076547, S10RR027584]
  2. National Science Foundation MRI [0923536]
  3. American Recovery and Reinvestment Act (ARRA) funds through National Institutes of Health
  4. National Human Genome Research Institute [RC2 HG005805]
  5. National Institute of Biomedical Imaging and Bioengineering [U54EB020406]
  6. Amazon Web Services Inc.

向作者/读者索取更多资源

Cloud computing, where scalable, on-demand compute cycles and storage are available as a service, has the potential to accelerate mass spectrometry-based proteomics research by providing simple, expandable, and affordable large-scale computing to all laboratories regardless of location or information technology expertise. We present new cloud computing functionality for the Trans-Proteomic Pipeline, a free and open-source suite of tools for the processing and analysis of tandem mass spectrometry datasets. Enabled with Amazon Web Services cloud computing, the Trans-Proteomic Pipeline now accesses large scale computing resources, limited only by the available Amazon Web Services infrastructure, for all users. The Trans-Proteomic Pipeline runs in an environment fully hosted on Amazon Web Services, where all software and data reside on cloud resources to tackle large search studies. In addition, it can also be run on a local computer with computationally intensive tasks launched onto the Amazon Elastic Compute Cloud service to greatly decrease analysis times. We describe the new Trans-Proteomic Pipeline cloud service components, compare the relative performance and costs of various Elastic Compute Cloud service instance types, and present on-line tutorials that enable users to learn how to deploy cloud computing technology rapidly with the Trans-Proteomic Pipeline. We provide tools for estimating the necessary computing resources and costs given the scale of a job and demonstrate the use of cloud enabled Trans-Proteomic Pipeline by performing over 1100 tandem mass spectrometry files through four proteomic search engines in 9 h and at a very low cost.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据