4.6 Article

The Drosophila melanogaster PeptideAtlas facilitates the use of peptide data for improved fly proteomics and genome annotation

期刊

BMC BIOINFORMATICS
卷 10, 期 -, 页码 -

出版社

BIOMED CENTRAL LTD
DOI: 10.1186/1471-2105-10-59

关键词

-

资金

  1. University Research Priority Program Systems Biology/Functional Genomics of the University of Zurich
  2. University of Zurich
  3. EMBO fellowship [127.00-2007)]
  4. National Science Foundation
  5. National Heart, Lung, and Blood Institute of the NIH [N01-HV-28179]

向作者/读者索取更多资源

Background: Crucial foundations of any quantitative systems biology experiment are correct genome and proteome annotations. Protein databases compiled from high quality empirical protein identifications that are in turn based on correct gene models increase the correctness, sensitivity, and quantitative accuracy of systems biology genome-scale experiments. Results: In this manuscript, we present the Drosophila melanogaster PeptideAtlas, a fly proteomics and genomics resource of unsurpassed depth. Based on peptide mass spectrometry data collected in our laboratory the portal http://www.drosophila-peptideatlas.org allows querying fly protein data observed with respect to gene model confirmation and splice site verification as well as for the identification of proteotypic peptides suited for targeted proteomics studies. Additionally, the database provides consensus mass spectra for observed peptides along with qualitative and quantitative information about the number of observations of a particular peptide and the sample(s) in which it was observed. Conclusion: PeptideAtlas is an open access database for the Drosophila community that has several features and applications that support (1) reduction of the complexity inherently associated with performing targeted proteomic studies, (2) designing and accelerating shotgun proteomics experiments, (3) confirming or questioning gene models, and (4) adjusting gene models such that they are in line with observed Drosophila peptides. While the database consists of proteomic data it is not required that the user is a proteomics expert.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Genetics & Heredity

The Gene Curation Coalition: A global effort to harmonize gene-disease evidence resources

Marina T. DiStefano, Scott Goehringer, Lawrence Babb, Fowzan S. Alkuraya, Joanna Amberger, Mutaz Amin, Christina Austin-Tse, Marie Balzotti, Jonathan S. Berg, Ewan Birney, Carol Bocchini, Elspeth A. Bruford, Alison J. Coffey, Heather Collins, Fiona Cunningham, Louise C. Daugherty, Yaron Einhorn, Helen Firth, David R. Fitzpatrick, Rebecca E. Foulger, Jennifer Goldstein, Ada Hamosh, Matthew R. Hurles, Sarah E. Leigh, Ivone U. S. Leong, Sateesh Maddirevula, Christa L. Martin, Ellen M. McDonagh, Annie Olry, Arina Puzriakova, Kelly Radtke, Erin M. Ramos, Ana Rath, Erin Rooney Riggs, Angharad M. Roberts, Charlotte Rodwell, Catherine Snow, Zornitza Stark, Jackie Tahiliani, Susan Tweedie, James S. Ware, Phillip Weller, Eleanor Williams, Caroline F. Wright, Thabo Michael Yates, Heidi L. Rehm

Summary: This study addresses the lack of universal standards and terminologies in defining gene-disease relationships. The Gene Curation Coalition (GenCC) was formed to establish harmonized definitions and develop a unified database. The results show that conflicts in gene-disease validity assertions exist, highlighting the importance of standardization and collaboration in genetic testing and variant interpretation.

GENETICS IN MEDICINE (2022)

Article Biochemical Research Methods

Proteomics Standards Initiative's ProForma 2.0: Unifying the Encoding of Proteoforms and Peptidoforms

Richard D. LeDuc, Eric W. Deutsch, Pierre-Alain Binz, Ryan T. Fellers, Anthony J. Cesnik, Joshua A. Klein, Tim Van den Bossche, Ralf Gabriels, Arshika Yalavarthi, Yasset Perez-Riverol, Jeremy Carver, Wout Bittremieux, Shin Kawano, Benjamin Pullman, Nuno Bandeira, Neil L. Kelleher, Paul M. Thomas, Juan Antonio Vizcaino

Summary: It is important for the proteomics community to have a standardized manner to represent all possible variations of a protein or peptide primary sequence. The ProForma 2.0 notation aims to unify the representation of proteoforms and peptidoforms, supporting different proteomics approaches and allowing the encoding of highly modified proteins and peptides using a human- and machine-readable string.

JOURNAL OF PROTEOME RESEARCH (2022)

Article Biochemical Research Methods

Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation

Anton Kalyuzhnyy, Patrick A. Eyers, Claire E. Eyers, Emily Bowler-Barnett, Maria J. Martin, Zhi Sun, Eric W. Deutsch, Andrew R. Jones

Summary: This study profiles the human phosphoproteome to estimate the false discovery rate of phosphosites and predict a more realistic count of true identifications. It highlights significant potential of false-positive data in phosphorylation databases and emphasizes the importance of quality control for such data.

JOURNAL OF PROTEOME RESEARCH (2022)

Article Biochemical Research Methods

Method for Independent Estimation of the False Localization Rate for Phosphoproteomics

Kerry A. Ramsbottom, Ananth Prakash, Yasset Perez Riverol, Oscar Martin Camacho, Maria-Jesus Martin, Juan Antonio Vizcaino, Eric W. Deutsch, Andrew R. Jones

Summary: Phosphoproteomic methods are used to identify and quantify phosphorylation sites on proteins. This study introduces the concept of scoring modifications on a decoy amino acid to estimate the global false localization rate (FLR) independently.

JOURNAL OF PROTEOME RESEARCH (2022)

Article Genetics & Heredity

FlyBase: a guided tour of highlighted features

L. Sian Gramates, Julie Agapite, Helen Attrill, Brian R. Calvi, Madeline A. Crosby, Gilberto dos Santos, Joshua L. Goodman, Damien Goutte-Gattat, Victoria K. Jenkins, Thomas Kaufman, Aoife Larkin, Beverley B. Matthews, Gillian Millburn, Victor B. Strelets

Summary: As FlyBase celebrates its fourth decade, it highlights its unique aspects and expresses its commitment to collaborate with other research communities. They emphasize the reports and tools dedicated to fly researchers' needs and provide multiple avenues for researchers to interact with FlyBase.

GENETICS (2022)

Article Genetics & Heredity

Harmonizing model organism data in the Alliance of Genome Resources

Paul W. Sternberg

Summary: The Alliance of Genome Resources is a collaborative effort of 7 knowledgebase projects aimed at providing better service, a harmonized view of data, and a sustainable infrastructure. It offers comparative views of gene function, gene expression, and human disease relevance. The information is presented on uniform gene pages for easy summarization.

GENETICS (2022)

Article Biochemistry & Molecular Biology

A standardised nomenclature for long non-coding RNAs

Ruth L. Seal, Susan Tweedie, Elspeth A. Bruford

Summary: The HGNC is the only organization authorized to approve symbols for human genes, ensuring clarity and ease of search in scientific literature and databases.

IUBMB LIFE (2023)

Article Biochemistry & Molecular Biology

Genenames.org: the HGNC resources in 2023

Ruth L. Seal, Bryony Braschi, Kristian Gray, Tamsin E. M. Jones, Susan Tweedie, Liora Haim-Vilmovsky, Elspeth A. Bruford

Summary: The HUGO Gene Nomenclature Committee (HGNC) is responsible for assigning unique symbols and names to human genes. Their database contains over 43,000 approved gene symbols, including protein-coding genes, pseudogenes, and non-coding RNA genes, which are displayed on their public website along with links to genomic, clinical, and proteomic information.

NUCLEIC ACIDS RESEARCH (2023)

Article Biochemical Research Methods

Proteomics Standards Initiative at Twenty Years: Current Activities and Future Work

Eric W. Deutsch, Juan Antonio Vizcaino, Andrew R. Jones, Pierre-Alain Binz, Henry Lam, Joshua Klein, Wout Bittremieux, Yasset Perez-Riverol, David L. Tabb, Mathias Walzer, Sylvie Ricard-Blum, Henning Hermjakob, Steffen Neumann, Tytus D. Mak, Shin Kawano, Luis Mendoza, Tim Van Den Bossche, Ralf Gabriels, Nuno Bandeira, Jeremy Carver, Benjamin Pullman, Zhi Sun, Nils Hoffmann, Jim Shofstahl, Yunping Zhu, Luana Licata, Federica Quaglia, Silvio C. E. Tosatto, Sandra E. Orchard

Summary: The Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has been developing guidelines and standards for the proteomics community for 20 years. The organization describes its operation, the current state of existing standards, and the proposals currently being developed. They emphasize the importance of community participation and collaboration with other organizations to promote data sharing and accelerate progress in proteomics.

JOURNAL OF PROTEOME RESEARCH (2023)

Article Biochemical Research Methods

ProteomicsML: An Online Platform for Community-Curated Data sets and Tutorials for Machine Learning in Proteomics

Tobias G. Rehfeldt, Ralf Gabriels, Robbin Bouwmeester, Siegfried Gessulat, Benjamin A. Neely, Magnus Palmblad, Yasset Perez-Riverol, Tobias Schmidt, Juan Antonio Vizcaino, Eric W. Deutsch

Summary: Data set acquisition and curation are challenging in machine learning, particularly for proteomics-based LC-MS data sets due to data reduction. ProteomicsML is introduced as an online resource for accessing proteomics-based data sets and tutorials. It simplifies data access and provides tutorials for interacting with advanced algorithms. ProteomicsML enables comparison of machine learning algorithms and offers introductory material for newcomers in the field. The platform is freely available at https://www.proteomicsml.org/, and contributions are welcome at https://github.com/ProteomicsML/ProteomicsML.

JOURNAL OF PROTEOME RESEARCH (2023)

Article Biochemical Research Methods

Trans-Proteomic Pipeline: Robust Mass Spectrometry-Based Proteomics Data Analysis Suite

Eric W. Deutsch, Luis Mendoza, David D. Shteynberg, Michael R. Hoopmann, Zhi Sun, Jimmy K. Eng, Robert L. Moritz

Summary: The Trans-Proteomic Pipeline (TPP) mass spectrometry data analysis suite has undergone continuous development and refinement since the publication of its first tools, PeptideProphet and ProteinProphet, 20 years ago. The latest release offers a wide range of tools for spectrum processing, spectrum searching, search validation, abundance computation, protein inference, and more. These tools incorporate machine-learning modeling to maximize information extraction and establish robust statistical models for accurate information derivation. This article presents an overview of the latest TPP tools, their deployment on different platforms, tutorials for their usage, and synergistic projects leveraging TPP. The article concludes with outlined plans for future TPP development.

JOURNAL OF PROTEOME RESEARCH (2023)

Article Biochemical Research Methods

Toward an Integrated Machine Learning Model of a Proteomics Experiment

Benjamin A. Neely, Viktoria Dorfer, Lennart Martens, Isabell Bludau, Robbin Bouwmeester, Sven Degroeve, Eric W. Deutsch, Siegfried Gessulat, Lukas Kaell, Pawel Palczynski, Samuel H. Payne, Tobias Greisager Rehfeldt, Tobias Schmidt, Veit Schwaemmle, Julian Uszkoreit, Juan Antonio Vizcaino, Mathias Wilhelm, Magnus Palmblad

Summary: In recent years, machine learning has made significant progress in modeling mass spectrometry data for proteomics analysis. A workshop was conducted to evaluate and explore machine learning applications in multidimensional mass spectrometry-based proteomics analysis. The workshop helped identify knowledge gaps, define needs, and discuss the possibilities, challenges, and future opportunities. The summary of the discussions conveys excitement about the potential of machine learning in proteomics and aims to inspire future research.

JOURNAL OF PROTEOME RESEARCH (2023)

Article Biochemical Research Methods

Does the Ubiquitination Degradation Pathway Really Reach inside of the Chloroplast? A Re-Evaluation of Mass Spectrometry-Based Assignments of Ubiquitination

Klaas J. van Wijk, Tami Leppert, Zhi Sun, Eric W. Deutsch

Summary: A recent paper in Science Advances suggests that intrachloroplast proteins in Arabidopsis can be polyubiquitinated and transferred to the cytosol for degradation, challenging the existing belief that chloroplast organelles lack polyubiquitination mechanisms. However, a reanalysis of the raw mass spectrometry data revealed discrepancies and issues with the results, including extensive artifactual modifications caused by the use of alkylating agent iodoacetamide. As a result, the conclusions regarding intra-chloroplast polyubiquitination cannot be supported by these data. Recommendations for future research are provided to avoid similar pitfalls.

JOURNAL OF PROTEOME RESEARCH (2023)

Article Biochemistry & Molecular Biology

PANGEA: a new gene set enrichment tool for Drosophila and common research organisms

Yanhui Hu, Aram Comjean, Helen Attrill, Giulia Antonazzo, Jim Thurmond, Weihang Chen, Fangge Li, Tiffany Chao, Stephanie E. Mohr, Nicholas H. Brown, Norbert Perrimon

Summary: PANGEA is a new GSEA tool that allows flexible and configurable data analysis using various classification sets. It enables GO analysis on different sets of GO annotations and provides visualization of results and comparison of multiple input gene lists.

NUCLEIC ACIDS RESEARCH (2023)

Article Plant Sciences

Detection and editing of the updated Arabidopsis plastid- and mitochondrial-encoded proteomes through PeptideAtlas

Klaas J. van Wijk, Stephane Bentolila, Tami Leppert, Qi Sun, Zhi Sun, Luis Mendoza, Margaret Li, Eric W. Deutsch

Summary: This study investigates the impact of RNA editing on the protein accumulation of organelle-encoded proteins in Arabidopsis. It identifies the nonredundant set of protein identifiers and predicted editing sites, and determines the accumulation of edited and unedited proteoforms at the protein level. The results show that major RNA editing sites are required for stable protein accumulation.

PLANT PHYSIOLOGY (2023)

暂无数据