4.6 Article

Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction

期刊

BMC BIOINFORMATICS
卷 13, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/1471-2105-13-235

关键词

Coevolution; Mutual information; Specificity determining position; Catalytic residues; Functional sites; Sequence analysis

资金

  1. Lounsbery Foundation
  2. National Library of Medicine of the Gulf Coast Consortia (NLM) [5T15LM07093]
  3. National Research Council of Argentina (CONICET)

向作者/读者索取更多资源

Background: A large panel of methods exists that aim to identify residues with critical impact on protein function based on evolutionary signals, sequence and structure information. However, it is not clear to what extent these different methods overlap, and if any of the methods have higher predictive potential compared to others when it comes to, in particular, the identification of catalytic residues (CR) in proteins. Using a large set of enzymatic protein families and measures based on different evolutionary signals, we sought to break up the different components of the information content within a multiple sequence alignment to investigate their predictive potential and degree of overlap. Results: Our results demonstrate that the different methods included in the benchmark in general can be divided into three groups with a limited mutual overlap. One group containing real-value Evolutionary Trace (rvET) methods and conservation, another containing mutual information (MI) methods, and the last containing methods designed explicitly for the identification of specificity determining positions (SDPs): integer-value Evolutionary Trace (ivET), SDPfox, and XDET. In terms of prediction of CR, we find using a proximity score integrating structural information (as the sum of the scores of residues located within a given distance of the residue in question) that only the methods from the first two groups displayed a reliable performance. Next, we investigated to what degree proximity scores for conservation, rvET and cumulative MI (cMI) provide complementary information capable of improving the performance for CR identification. We found that integrating conservation with proximity scores for rvET and cMI achieved the highest performance. The proximity conservation score contained no complementary information when integrated with proximity rvET. Moreover, the signal from rvET provided only a limited gain in predictive performance when integrated with mutual information and conservation proximity scores. Combined, these observations demonstrate that the rvET and cMI scores add complementary information to the prediction system. Conclusions: This work contributes to the understanding of the different signals of evolution and also shows that it is possible to improve the detection of catalytic residues by integrating structural and higher order sequence evolutionary information with sequence conservation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Immunology

Accurate MHC Motif Deconvolution of Immunopeptidomics Data Reveals a Significant Contribution of DRB3, 4 and 5 to the Total DR Immunopeptidome

Saghar Kaabinejadian, Carolina Barra, Bruno Alvarez, Hooman Yari, William H. Hildebrand, Morten Nielsen

Summary: Mass spectrometry-based immunopeptidomics is an important technique in biomedical applications, but the complexity of the data and the lack of appropriate tools have hindered its large-scale application. In this study, a new tool called MHCMotifDecon is presented, which accurately deconvolutes immunopeptidome datasets and helps identify and characterize HLA binding motifs, thus facilitating the discovery of new T cell targets.

FRONTIERS IN IMMUNOLOGY (2022)

Article Multidisciplinary Sciences

Combined assessment of MHC binding and antigen abundance improves T cell epitope predictions

Zeynep Kosaloglu-Yalcin, Jenny Lee, Jason Greenbaum, Stephen P. Schoenberger, Aaron Miller, Young J. Kim, Alessandro Sette, Morten Nielsen, Bjoern Peters

Summary: The prediction accuracy of antigen epitopes can be further improved by considering the abundance levels of peptides' source proteins. By incorporating biophysical principles, existing MHC binding prediction tools, and abundance estimates of source proteins, a function was derived to estimate the likelihood of a peptide to be an MHC class I ligand. The use of proteomic data showed the highest performance in improving epitope predictions.

ISCIENCE (2022)

Review Biochemical Research Methods

The interdependence of machine learning and LC-MS approaches for an unbiased understanding of the cellular immunopeptidome

Morten Nielsen, Nicola Ternette, Carolina Barra

Summary: This article discusses the concept, applications, and challenges of immunopeptidome and emphasizes the benefits and limitations of liquid chromatography-tandem mass spectrometry (MS) in obtaining large-scale immunopeptidome data sets. It highlights the importance of refined and highly optimized machine learning approaches for accurate analysis and interpretation of the data. Furthermore, it showcases the use of MS-immunopeptidomics data in improving the accuracy of immunoinformatics prediction methods and demonstrates the synergistic combination of MS experiments and in silico models for optimal antigen discovery.

EXPERT REVIEW OF PROTEOMICS (2022)

Article Biochemistry & Molecular Biology

NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning

Magnus Haraldson Hoie, Erik Nicolas Kiehl, Bent Petersen, Morten Nielsen, Ole Winther, Henrik Nielsen, Jeppe Hallgren, Paolo Marcatili

Summary: Recent advances in machine learning and natural language processing have enabled accurate prediction of protein structures and functions, with NetSurfP-3.0 standing out as a tool with drastically improved runtime and reliable prediction performance.

NUCLEIC ACIDS RESEARCH (2022)

Article Multidisciplinary Sciences

Neoantigen-specific CD8 T cell responses in the peripheral blood following PD-L1 blockade might predict therapy outcome in metastatic urothelial carcinoma

Jeppe Sejero Holm, Samuel A. Funt, Annie Borch, Kamilla Kjaergaard Munk, Anne-Mette Bjerregaard, James L. Reading, Colleen Maher, Ashley Regazzi, Phillip Wong, Hikmat Al-Ahmadie, Gopa Iyer, Tripti Tamhane, Amalie Kai Bentzen, Nana Overgaard Herschend, Susan De Wolf, Alexandra Snyder, Taha Merghoub, Jedd D. Wolchok, Morten Nielsen, Jonathan E. Rosenberg, Dean F. Bajorin, Sine Reker Hadrup

Summary: This study demonstrates that the expansion of neoantigen-specific CD8(+) T cells can distinguish between patients with controlled disease and progressive disease in metastatic urothelial carcinoma treated with PD-L1 blockade. Furthermore, the peripheral NARTs derived from patients with disease control exhibit specific cell phenotypes and increased CD39 levels, suggesting their association with treatment response.

NATURE COMMUNICATIONS (2022)

Article Virology

Tracking SARS-CoV-2 mutations and variants through the COG-UK-Mutation Explorer

Derek W. Wright, William T. Harvey, Joseph Hughes, MacGregor Cox, Thomas P. Peacock, Rachel Colquhoun, Ben Jackson, Richard Orton, Morten Nielsen, Nienyun Sharon Hsu, Ewan M. Harrison, Thushan de Silva, Andrew Rambaut, Sharon J. Peacock, David L. Robertson, Alessandro M. Carabelli

Summary: COG-UK Mutation Explorer is a web resource that provides knowledge and analysis on SARS-CoV-2 virus genome mutations and variants in the UK. It focuses on antigenic amino acid replacements that have immunological significance. The resource curates data from over 2 million genome sequences and cross-references them with experimental data from the literature. It tracks mutations that could impact the neutralizing activity of antibodies and vaccines, as well as changes in T cell epitopes and resistance to antiviral drugs.

VIRUS EVOLUTION (2022)

Article Biochemistry & Molecular Biology

Defective Proinsulin Handling Modulates the MHC I Bound Peptidome and Activates the Inflammasome in β-Cells

Muhammad Saad Khilji, Pouya Faridi, Erika Pinheiro-Machado, Carolin Hoefner, Tina Dahlby, Ritchlynn Aranha, Soren Buus, Morten Nielsen, Justyna Klusek, Thomas Mandrup-Poulsen, Kirti Pandey, Anthony W. Purcell, Michal T. Marzec

Summary: The loss of GRP94 from the endoplasmic reticulum results in mishandling of proinsulin, ER stress, and activation of the immunoproteasome, leading to the sensitization of beta-cells to immune attack.

BIOMEDICINES (2022)

Article Biochemical Research Methods

A comprehensive analysis of the IEDB MHC class-I automated benchmark

Raphael Trevizani, Zhen Yan, Jason A. Greenbaum, Alessandro Sette, Morten Nielsen, Bjoern Peters

Summary: An approach to assess the reliability of different metrics for evaluating the performance of MHC class I binding predictors was developed. The study found that using percentile-ranked results improved the stability of the ranks and identified the top-performing tools in the benchmark.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemistry & Molecular Biology

The Cancer Epitope Database and Analysis Resource (CEDAR)

Zeynep Kosaloglu-Yalcin, Nina Blazeska, Randi Vita, Hannah Carter, Morten Nielsen, Stephen Schoenberger, Alessandro Sette, Bjoern Peters

Summary: CEDAR is a freely accessible database and analysis resource for cancer epitopes, which are molecular targets recognized by anti-cancer immune cells. Detailed knowledge of cancer epitopes is crucial for understanding and planning cancer prevention, treatment, and immune responses.

NUCLEIC ACIDS RESEARCH (2023)

Article Multidisciplinary Sciences

The role of antigen expression in shaping the repertoire of HLA presented ligands

Heli M. Garcia Alvarez, Zeynep Kosaloglu-Yalcin, Bjoern Peters, Morten Nielsen

Summary: This study finds that the prediction performance of HLA antigen presentation can be improved by integrating information on antigen abundance, which has important implications for immunotherapy and vaccine design.

ISCIENCE (2022)

Article Oncology

Thiopurine 6TG treatment increases tumor immunogenicity and response to immune checkpoint blockade

Loulieta Nazerai, Shona Caroline Willis, Patricio Yankilevich, Luca Di Leo, Francesca Maria Bosisio, Alex Frias, Corine Bertolotto, Jacob Nersting, Maria Thastrup, Soren Buus, Allan Randrup Thomsen, Morten Nielsen, Kristoffer Staal Rohrberg, Kjeld Schmiegelow, Daniela De Zio

Summary: This study used the drug 6TG to induce mutations in tumor cells and increase the level of neoepitopes, enhancing the immune response. 6TG exposure increased tumor mutational burden and reshaped the tumor microenvironment, making the tumors more responsive to immune-checkpoint blockade.

ONCOIMMUNOLOGY (2023)

Article Biology

Improved T cell receptor antigen pairing through data-driven filtering of sequencing information from single cells

Helle Rus Povlsen, Amalie Kai Bentzen, Mohammad Kadivar, Leon Eyrich Jessen, Sine Reker Hadrup, Morten Nielsen, K. Christopher Garcia

Summary: Novel single-cell-based technologies enable high-throughput matching of T cell receptor (TCR) sequences with their cognate peptide-MHC recognition motif. A data-driven method called ITRAP is proposed to filter out likely artifacts and generate large sets of TCR-pMHC sequence data with high specificity and sensitivity. This approach has been validated in virus-specific T cell responses across multiple healthy donors.
Article Biology

Machine learning reveals limited contribution of trans-only encoded variants to the HLA-DQ immunopeptidome

Jonas Birkelund Nilsson, Saghar Kaabinejadian, Hooman Yari, Bjoern Peters, Carolina Barra, Loren Gragert, William Hildebrand, Morten Nielsen

Summary: This study addresses the prediction of HLA-DQ antigen presentation and the contribution of trans-only variants in shaping the HLA-DQ immunopeptidome. By integrating immunoinformatics data mining models with mass spectrometry immunopeptidomics data, the study demonstrates improved predictive power and molecular coverage for models trained with novel HLA-DQ data. The study also reveals the limited contribution of trans-only HLA-DQ variants to the overall HLA-DQ immunopeptidome.

COMMUNICATIONS BIOLOGY (2023)

Article Immunology

The structure of songbird MHC class I reveals antigen binding that is flexible at the N-terminus and static at the C-terminus

Sandra Eltschkner, Samantha Mellinger, Soren Buus, Morten Nielsen, Kajsa M. M. Paulsson, Karin Lindkvist-Petersson, Helena Westerdahl

Summary: Long-distance migratory animals such as birds and bats have evolved a unique adaptive immunity with highly duplicated Major Histocompatibility Complex (MHC) genes to withstand diverse pathogens. A study on the MHC class I protein, Acar3, from the great reed warbler reveals a peculiar peptide-binding mode that potentially facilitates interactions with innate immune receptors. The investigation highlights the importance of studying the immune system of wild animals to uncover unique immune mechanisms absent in humans and model organisms.

FRONTIERS IN IMMUNOLOGY (2023)

Article Mathematical & Computational Biology

NetAllergen, a random forest model integrating MHC-II presentation propensity for improved allergenicity prediction

Yuchen Li, Peter Wad Sackett, Morten Nielsen, Carolina Barra

Summary: This article introduces a new protein allergenicity prediction method, which introduces MHC presentation propensity as a novel feature to overcome the limitations of previous methods in accurately predicting allergenicity when similarity diminishes.

BIOINFORMATICS ADVANCES (2023)

暂无数据