4.7 Article

Mass Fingerprinting of Complex Mixtures: Protein Inference from High-Resolution Peptide Masses and Predicted Retention Times

期刊

JOURNAL OF PROTEOME RESEARCH
卷 12, 期 12, 页码 5730-5741

出版社

AMER CHEMICAL SOC
DOI: 10.1021/pr400705q

关键词

bioinformatics; mass spectrometry; computational proteomics; shotgun proteomics; mass fingerprinting; retention time prediction

资金

  1. Swedish Research Council
  2. U.S. National Science Foundation (MRI) [0923536]
  3. American Recovery and Reinvestment Act (ARRA) funds [R01 HG005805]
  4. National Institute of General Medical Sciences/Center for Systems Biology [2P50 GM076547]
  5. Luxembourg Centre for Systems Biomedicine
  6. University of Luxembourg
  7. Div Of Biological Infrastructure
  8. Direct For Biological Sciences [0923536] Funding Source: National Science Foundation

向作者/读者索取更多资源

In typical shotgun experiments, the mass spectrometer records the masses of a large set of ionized analytes but fragments only a fraction of them. In the subsequent analyses, normally only the fragmented ions are used to compile a set of peptide identifications, while the unfragmented ones are disregarded. In this work, we show how the unfragmented ions, here denoted MS1-features, can be used to increase the confidence of the proteins identified in shotgun experiments. Specifically, we propose the usage of in silico mass tags, where the observed MS1-features are matched against de novo predicted masses and retention times for all peptides derived from a sequence database. We present a statistical model to assign protein-level probabilities based on the MS1-features and combine this data with the fragmentation spectra. Our approach was evaluated for two triplicate data sets from yeast and human, respectively, leading to up to 7% more protein identifications at a fixed protein-level false discovery rate of 1%. The additional protein identifications were validated both in the context of the mass spectrometry data and by examining their estimated transcript levels generated using RNA-Seq. The proposed method is reproducible, straightforward to apply, and can even be used to reanalyze and increase the yield of existing data sets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Cardiac & Cardiovascular Systems

Absence of full-length dystrophin impairs normal maturation and contraction of cardiomyocytes derived from human-induced pluripotent stem cells

J. Manuel Pioner, Xuan Guan, Jordan M. Klaiman, Alice W. Racca, Lil Pabon, Veronica Muskheli, Jesse Macadangdang, Cecilia Ferrantini, Michael R. Hoopmann, Robert L. Moritz, Deok-Ho Kim, Chiara Tesi, Corrado Poggesi, Charles E. Murry, Martin K. Childers, David L. Mack, Michael Regnier

CARDIOVASCULAR RESEARCH (2020)

Article Multidisciplinary Sciences

Focus on the spectra that matter by clustering of quantification data in shotgun proteomics

Matthew The, Lukas Kall

NATURE COMMUNICATIONS (2020)

Article Biochemical Research Methods

Performing Selection on a Monotonic Function in Lieu of Sorting Using Layer-Ordered Heaps

Kyle Lucke, Jake Pennington, Patrick Kreitzberg, Lukas Kall, Oliver Serang

Summary: Nonparametric statistical tests are essential in scientific experiments, but sorting becomes a bottleneck in computation. Selection and partitioning are usually performed on transformed statistical significance values due to the unknown threshold, necessitating data sorting. Layer-ordered heaps provide a solution by partially sorting values efficiently.

JOURNAL OF PROTEOME RESEARCH (2021)

Article Biochemical Research Methods

SPACEPro: A Software Tool for Analysis of Protein Sample Cleavage for Tandem Mass Spectrometry

Vidur Kailash, Luis Mendoza, Robert L. Moritz, Michael R. Hoopmann

Summary: SPACEPro is a tool developed to evaluate the efficiency and reproducibility of protein cleavage in peptide samples, providing comprehensive information on sample processing to peptides at the PSM, peptide, and protein levels. It helps users understand the quality of sample processing efficiency and develop protocols for improvement with minimal effort.

JOURNAL OF PROTEOME RESEARCH (2021)

Article Biochemical Research Methods

Triqler for MaxQuant: Enhancing Results from MaxQuant by Bayesian Error Propagation and Integration

Matthew The, Lukas Kall

Summary: Error estimation for differential protein quantification is challenging due to multiple error sources, but Triqler model combines them into one quantification error. The interface with MaxQuant allows quick reanalysis of processed data, showing superior performance in various data sets. Triqler and its interface are available as a Python module under Apache 2.0 license.

JOURNAL OF PROTEOME RESEARCH (2021)

Article Microbiology

A redox-active crosslinker reveals an essential and inhibitable oxidative folding network in the endoplasmic reticulum of malaria parasites

David W. Cobb, Heather M. Kudyba, Alejandra Villegas, Michael R. Hoopmann, Rodrigo P. Baptista, Baylee Bruton, Michelle Krakowiak, Robert L. Moritz, Vasant Muralidharan

Summary: The study revealed the essential role of Thioredoxin (Trx) superfamily proteins in the endoplasmic reticulum (ER) of Plasmodium falciparum for parasite survival. By identifying a small molecule inhibitor targeting the redox activities of Trx-domain proteins, the study provides a new direction for future antimalarial drug development.

PLOS PATHOGENS (2021)

Editorial Material Biochemical Research Methods

2021 Special Issue on Software Tools and Resources: Finding the Right Tools for the Job

Susan T. Weintraub, Michael R. Hoopmann, Magnus Palmblad

JOURNAL OF PROTEOME RESEARCH (2021)

Article Biochemical Research Methods

Interpretation of the DOME Recommendations for Machine Learning in Proteomics and Metabolomics

Magnus Palmblad, Sebastian Boecker, Sven Degroeve, Oliver Kohlbacher, Lukas Kall, William Stafford Noble, Mathias Wilhelm

Summary: This article discusses the specific guidelines for conducting and reporting machine learning in proteomics and metabolomics, emphasizing the importance of describing molecular structure, function, and physicochemical properties.

JOURNAL OF PROTEOME RESEARCH (2022)

Article Biochemical Research Methods

Putting Humpty Dumpty Back Together Again: What Does Protein Quantification Mean in Bottom-Up Proteomics?

Deanna L. Plubell, Lukas Kall, Bobbie-Jo Webb-Robertson, Lisa M. Bramer, Ashley Ives, Neil L. Kelleher, Lloyd M. Smith, Thomas J. Montine, Christine C. Wu, Michael J. MacCoss

Summary: Traditional proteomics methods aggregate peptide quantities into protein groups, potentially overlooking important biological differences between different components. Therefore, it is necessary to reconsider the practice of aggregating protein values to a single quantity in order to capture the contributions of existing proteoforms.

JOURNAL OF PROTEOME RESEARCH (2022)

Article Biochemical Research Methods

Prosit Transformer: A transformer for Prediction of MS2 Spectrum Intensities

Markus Ekvall, Patrick Truong, Wassim Gabriel, Mathias Wilhelm, Lukas Kall

Summary: Machine learning has long been essential in interpreting proteomics data from mass spectrometry. Recently, the Transformer model, successful in other fields of bioinformatics, has been applied with the convenience of transfer learning. In this study, a Transformer based on the TAPE model was implemented to predict MS2 intensities, outperforming the traditional recurrent neural network-based predictor Prosit.

JOURNAL OF PROTEOME RESEARCH (2022)

Article Biochemical Research Methods

Survival analysis of pathway activity as a prognostic determinant in breast cancer

Gustavo S. Jeuken, Nicholas P. Tobin, Lukas Kall

Summary: High throughput biology allows for the measurement of relative concentrations of biomolecules from tissue samples, but interpreting the resulting differences between samples remains a challenge. Pathway analysis provides a way to study these differences in terms of altered pathway activity. In this study, we demonstrate the use of a method for pathway-based survival analysis, using the METABRIC dataset. Our findings show that pathway activities can serve as better prognostic markers for survival than individual transcripts.

PLOS COMPUTATIONAL BIOLOGY (2022)

Article Biochemical Research Methods

Triqler for Protein Summarization of Data from Data-Independent Acquisition Mass Spectrometry

Patrick Truong, Matthew The, Lukas Kall

Summary: A common goal in quantitative shotgun proteomics experiments is to identify proteins that show differential abundance under specific experimental conditions. However, this is challenging as mass spectrometry analyzes protein fragments rather than intact proteins. In this study, we demonstrate that a Bayesian hierarchical probabilistic model called Triqler can be adapted for data-independent acquisition (DIA) data after making minor changes to account for missing values. Furthermore, we show that Triqler outperforms other state-of-the-art protein summarization tools when evaluated on DIA data.

JOURNAL OF PROTEOME RESEARCH (2023)

Article Biochemical Research Methods

Toward an Integrated Machine Learning Model of a Proteomics Experiment

Benjamin A. Neely, Viktoria Dorfer, Lennart Martens, Isabell Bludau, Robbin Bouwmeester, Sven Degroeve, Eric W. Deutsch, Siegfried Gessulat, Lukas Kaell, Pawel Palczynski, Samuel H. Payne, Tobias Greisager Rehfeldt, Tobias Schmidt, Veit Schwaemmle, Julian Uszkoreit, Juan Antonio Vizcaino, Mathias Wilhelm, Magnus Palmblad

Summary: In recent years, machine learning has made significant progress in modeling mass spectrometry data for proteomics analysis. A workshop was conducted to evaluate and explore machine learning applications in multidimensional mass spectrometry-based proteomics analysis. The workshop helped identify knowledge gaps, define needs, and discuss the possibilities, challenges, and future opportunities. The summary of the discussions conveys excitement about the potential of machine learning in proteomics and aims to inspire future research.

JOURNAL OF PROTEOME RESEARCH (2023)

Article Microbiology

Roles of bacteriophages, plasmids and CRISPR immunity in microbial community dynamics revealed using time-series integrated meta-omics

Susana Martinez Arbas, Shaman Narayanasamy, Malte Herold, Laura A. Lebrun, Michael R. Hoopmann, Sujun Li, Tony J. Lam, Benoit J. Kunath, Nathan D. Hicks, Cindy M. Liu, Lance B. Price, Cedric C. Laczny, John D. Gillece, James M. Schupp, Paul S. Keim, Robert L. Moritz, Karoline Faust, Haixu Tang, Yuzhen Ye, Alexander Skupin, Patrick May, Emilie E. L. Muller, Paul Wilmes

Summary: The study found that CRISPR plays a more important role in defense against plasmids than phages, and the variability in plasmid abundance over time has a greater impact on community dynamics than phages. In addition, the enrichment of protospacers targeting genes involved in the transmission of iMGEs helps understand population fitness and devise control strategies.

NATURE MICROBIOLOGY (2021)

Article Biochemical Research Methods

Insights from the First Phosphopeptide Challenge of the MS Resource Pillar of the HUPO Human Proteome Project

Michael R. Hoopmann, Ulrike Kusebauch, Magnus Palmblad, Nuno Bandeira, David D. Shteynberg, Lingjie He, Bin Xia, Stoyan H. Stoychev, Gilbert S. Omenn, Susan T. Weintraub, Robert L. Moritz

JOURNAL OF PROTEOME RESEARCH (2020)

暂无数据