4.7 Article

Flexible and Accessible Workflows for Improved Proteogenomic Analysis Using the Galaxy Framework

期刊

JOURNAL OF PROTEOME RESEARCH
卷 13, 期 12, 页码 5898-5908

出版社

AMER CHEMICAL SOC
DOI: 10.1021/pr500812t

关键词

proteogenomics; workflows; salivary proteins; customized database generation; peptide corresponding to a novel proteoform; peptide-spectral match evaluation

资金

  1. NSF [11476079]
  2. NIH [1P50HG004952]
  3. NIH Genomic Sciences Training Program [5T32HG002760]

向作者/读者索取更多资源

Proteogenomics combines large-scale genomic and transcriptomic data with mass-spectrometry-based proteomic data to discover novel protein sequence variants and improve genome annotation. In contrast with conventional proteomic applications, proteogenomic analysis requires a number of additional data processing steps. Ideally, these required steps would be integrated and automated via a single software platform offering accessibility for wet-bench researchers as well as flexibility for user-specific customization and integration of new software tools as they emerge. Toward this end, we have extended the Galaxy bioinformatics framework to facilitate proteogenomic analysis. Using analysis of whole human saliva as an example, we demonstrate Galaxys flexibility through the creation of a modular workflow incorporating both established and customized software tools that improve depth and quality of proteogenomic results. Our customized Galaxy-based software includes automated, batch-mode BLASTP searching and a Peptide Sequence Match Evaluator tool, both useful for evaluating the veracity of putative novel peptide identifications. Our complex workflow (approximately 140 steps) can be easily shared using built-in Galaxy functions, enabling their use and customization by others. Our results provide a blueprint for the establishment of the Galaxy framework as an ideal solution for the emerging field of proteogenomics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Review Biochemical Research Methods

An overview of technologies for MS-based proteomics-centric multi-omics

Andrew T. Rajczewski, Pratik D. Jagtap, Timothy J. Griffin

Summary: Proteomics, using mass spectrometry, is a valuable tool in biomedical and basic biological research to identify proteins in a system of interest. When integrated with other 'omics' data, such as transcriptomics, genomics, and metabolomics, it provides a more comprehensive understanding of biological systems and their responses to stimuli. This integration, known as multi-omics, requires complete and accurate proteomics data and can be challenging for inexperienced researchers.

EXPERT REVIEW OF PROTEOMICS (2022)

Article Biochemistry & Molecular Biology

The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2022 update

Alexander E. Ostrovsky, Alexandru Mahmoud, Andrew J. Lonie, Anna Syme, Anne Fouilloux, Anthony Bretaudeau, Anton Nekrutenko, Anup Kumar, Arthur C. Eschenlauer, Assunta D. DeSanto, Aysam Guerler, Beatriz Serrano-Solano, Berenice Batut, Bjoern A. Gruening, Bradley W. Langhorst, Bridget Carr, Bryan A. Raubenolt, Cameron J. Hyde, Catherine J. Bromhead, Christopher B. Barnett, Coline Royaux, Cristobal Gallardo, Daniel Blankenberg, Daniel J. Fornika, Dannon Baker, Dave Bouvier, Dave Clements, David A. de Lima Morais, David Lopez Tabernero, Delphine Lariviere, Engy Nasr, Enis Afgan, Federico Zambelli, Florian Heyl, Fotis Psomopoulos, Frederik Coppens, Gareth R. Price, Gianmauro Cuccuru, Gildas Le Corguille, Greg Von Kuster, Gulsum Gudukbay Akbulut, Helena Rasche, Hotz Hans-Rudolf, Ignacio Eguinoa, Igor Makunin, Isuru J. Ranawaka, James P. Taylor, Jayadev Joshi, Jennifer Hillman-Jackson, Jeremy Goecks, John M. Chilton, Kaivan Kamali, Keith Suderman, Krzysztof Poterlowicz, Le Bras Yvan, Lucille Lopez-Delisle, Luke Sargent, Madeline E. Bassetti, Marco Antonio Tangaro, Marius van den Beek, Martin Cech, Matthias Bernt, Matthias Fahrner, Mehmet Tekman, Melanie C. Foell, Michael C. Schatz, Michael R. Crusoe, Miguel Roncoroni, Natalie Kucher, Nate Coraor, Nicholas Stoler, Nick Rhodes, Nicola Soranzo, Niko Pinter, Nuwan A. Goonasekera, Pablo A. Moreno, Pavankumar Videm, Petera Melanie, Pietro Mandreoli, Pratik D. Jagtap, Qiang Gu, Ralf J. M. Weber, Ross Lazarus, Ruben H. P. Vorderman, Saskia Hiltemann, Sergey Golitsynskiy, Shilpa Garg, Simon A. Bray, Simon L. Gladman, Simone Leo, Subina P. Mehta, Timothy J. Griffin, Vahid Jalili, Vandenbrouck Yves, Victor Wen, Vijay K. Nagampalli, Wendi A. Bacon, Willem de Koning, Wolfgang Maier, Peter J. Briggs

Summary: Galaxy is a mature and browser accessible workbench for scientific computing, allowing scientists to easily share, analyze, and visualize their data. It has a strong global community and support from national infrastructure providers. Key technical developments of Galaxy include improved user interface, interactive tools for data analysis, and a complete suite of machine learning tools. Important scientific developments enabled by Galaxy include Vertebrate Genome Project (VGP) assembly workflows and global SARS-CoV-2 collaborations.

NUCLEIC ACIDS RESEARCH (2022)

Article Microbiology

Gut microbial beta-glucuronidases regulate host luminal proteases and are depleted in irritable bowel syndrome

Adam L. Edwinson, Lu Yang, Stephanie Peters, Nikita Hanning, Patricio Jeraldo, Pratik Jagtap, Joshua B. Simpson, Tzu-Yi Yang, Praveen Kumar, Subina Mehta, Asha Nair, Margaret Breen-Lyles, Lakshmikanth Chikkamenahalli, Rondell P. Graham, Benedicte De Winter, Robin Patel, Surendra Dasari, Purna Kashyap, Timothy Griffin, Jun Chen, Gianrico Farrugia, Matthew R. Redinbo, Madhusudan Grover

Summary: Intestinal protease activity is suppressed by gut microbiota through the production of unconjugated bilirubin. In irritable bowel syndrome patients, an altered gut microbiota composition results in increased protease activity.

NATURE MICROBIOLOGY (2022)

Article Virology

Catching the Wave: Detecting Strain-Specific SARS-CoV-2 Peptides in Clinical Samples Collected during Infection Waves from Diverse Geographical Locations

Subina Mehta, Valdemir M. Carvalho, Andrew T. Rajczewski, Olivier Pible, Bjoern A. Gruening, James E. Johnson, Reid Wagner, Jean Armengaud, Timothy J. Griffin, Pratik D. Jagtap

Summary: The COVID-19 pandemic caused by the SARS-CoV-2 virus has resulted in a global health crisis, with the emergence of new strains posing challenges in detection. Mass spectrometry (MS)-based methods can help in diagnosing and developing vaccines by detecting and characterizing variant-specific peptide sequences from viral proteins. In this study, a bioinformatics workflow was developed to detect variant-specific peptide sequences from MS data derived from clinical samples. The workflow was shown to be effective in characterizing clinical data from different parts of the world, identifying six SARS-CoV-2 variant-specific peptides suitable for confident detection by MS in commonly collected clinical samples.

VIRUSES-BASEL (2022)

Article Respiratory System

Lung proteome and metabolome endotype in HIV-associated obstructive lung disease

Sarah Samorodnitsky, Eric F. Lock, Monica Kruk, Alison Morris, Janice M. Leung, Ken M. Kunisaki, Timothy J. Griffin, Chris H. Wendt

Summary: This study used aptamer proteomics to identify proteins and associated pathways in HIV-associated obstructive lung disease. The results showed that protein expression differs in persons living with HIV with and without obstructive lung disease. A unique protein endotype associated with insulin and apoptotic pathways was identified.

ERJ OPEN RESEARCH (2023)

Review Chemistry, Medicinal

Discovery of Modified Metabolites, Secondary Metabolites, and Xenobiotics by Structure-Oriented LC-MS/MS

Kevin J. Murray, Peter W. Villalta, Timothy J. Griffin, Silvia Balbo

Summary: The identification of covalently modified biomolecules, secondary metabolites, and xenobiotics is challenging in global metabolomics profiling. Liquid chromatography-coupled mass spectrometry (LC-MS) small molecule analytical workflows have been developed for the detection and characterization of these compounds. Continued advances in these methods expand the capacity for selective compound discovery and characterization.

CHEMICAL RESEARCH IN TOXICOLOGY (2023)

Review Biochemical Research Methods

A Galaxy of informatics resources for MS-based proteomics

Subina Mehta, Matthias Bernt, Matthew Chambers, Matthias Fahrner, Melanie Christine Foell, Bjoern Gruening, Carlos Horro, James E. Johnson, Valentin Loux, Andrew T. Rajczewski, Oliver Schilling, Yves Vandenbrouck, Ove Johan Ragnar Gustafsson, W. C. Mike Thang, Cameron Hyde, Gareth Price, Pratik D. Jagtap, Timothy J. Griffin

Summary: The Galaxy ecosystem offers a range of open-source tools for MS-based proteomics analyses, providing an adaptable, scalable, and accessible computing environment. This community-supported resource is crucial for basic biological and clinical studies, and ongoing developments are underway to meet emerging challenges in MS-based proteomic informatics.

EXPERT REVIEW OF PROTEOMICS (2023)

Article Biochemical Research Methods

Metaproteomic Analysis of Nasopharyngeal Swab Samples to Identify Microbial Peptides in COVID-19 Patients

Surbhi Bihani, Aryan Gupta, Subina Mehta, Andrew T. Rajczewski, James Johnson, Dhanush Borishetty, Timothy J. Griffin, Sanjeeva Srivastava, Pratik D. Jagtap

Summary: During the COVID-19 pandemic, the exploration of the microbiome is necessary due to impaired immunity and cases of secondary infections. This study used mass spectrometry-based data to investigate the metaproteome of nasopharyngeal swab samples from COVID-19 patients. Through a bioinformatics workflow, microbial peptides belonging to opportunistic pathogens were detected, and upregulated microbial proteins were found in severe patients. Clinical metaproteomics based on mass spectrometry can be a powerful tool for detecting and characterizing potential pathogens, impacting the diagnosis and treatment of patients.

JOURNAL OF PROTEOME RESEARCH (2023)

Letter Respiratory System

Compartment-specific protein interactions in beryllium lung disease

Li Li, Brian Vestal, Margaret M. Mroz, Sucai Liu, Kristyn MacPhail, Tim J. Griffin, Ivana V. Yang, Lisa A. Maier, Maneesh Bhargava

ERJ OPEN RESEARCH (2023)

Article Biochemistry & Molecular Biology

Quantitative Proteogenomic Characterization of Inflamed Murine Colon Tissue Using an Integrated Discovery, Verification, and Validation Proteogenomic Workflow

Andrew T. Rajczewski, Qiyuan Han, Subina Mehta, Praveen Kumar, Pratik D. Jagtap, Charles G. Knutson, James G. Fox, Natalia Y. Tretyakova, Timothy J. Griffin

Summary: Chronic inflammation of the colon can lead to the expression of non-canonical protein sequences, contributing to oncogenesis. This study used a mouse model to induce chronic inflammation and generated a customized database of non-canonical protein sequences. Through proteogenomic analysis, several non-canonical peptide sequences were identified and validated. The study highlights the challenges of identifying non-canonical peptides and provides an integrated workflow to address these challenges.

PROTEOMES (2022)

暂无数据