4.7 Article Proceedings Paper

Large scale analysis of protein stability in OMIM disease related human protein variants

Journal

BMC GENOMICS
Volume 17, Issue -, Pages -

Publisher

BIOMED CENTRAL LTD
DOI: 10.1186/s12864-016-2726-y

Keywords

Protein stability; Disease related-variations; Residue solvent accessibility; Interactomics networks

Ask authors/readers for more resources

Background: Modern genomic techniques allow to associate several Mendelian human diseases to single residue variations in different proteins. Molecular mechanisms explaining the relationship among genotype and phenotype are still under debate. Change of protein stability upon variation appears to assume a particular relevance in annotating whether a single residue substitution can or cannot be associated to a given disease. Thermodynamic properties of human proteins and of their disease related variants are lacking. In the present work, we take advantage of the available three dimensional structure of human proteins for predicting the role of disease related variations on the perturbation of protein stability. Results: We develop INPS3D, a new predictor based on protein structure for computing the effect of single residue variations on protein stability (Delta Delta G), scoring at the state-of-the-art (Pearson's correlation value of the regression is equal to 0.72 with mean standard error of 1.15 kcal/mol on a blind test set comprising 351 variations in 60 proteins). We then filter 368 OMIM disease related proteins known with atomic resolution (where the three dimensional structure covers at least 70 % of the sequence) with 4717 disease related single residue variations and 685 polymorphisms without clinical consequence. We find that the effect on protein stability of disease related variations is larger than the effect of polymorphisms: in particular, by setting to vertical bar 1 kcal/mol vertical bar the threshold between perturbing and not perturbing variations of the protein stability, about 44 % of disease related variations and 20 % of polymorphisms are predicted with vertical bar Delta Delta G vertical bar > 1 kcal/mol, respectively. A consistent fraction of OMIM disease related variations is however predicted to promote |Delta Delta G| <= 1 kcal/mol and we focus here on detecting features that can be associated to the thermodynamic property of the protein variant. Our analysis reveals that some 47 % of disease related variations promoting |Delta Delta G| <= 1 are located in solvent exposed sites of the protein structure. We also find that the increase of the fraction of variations that in proteins are predicted with vertical bar Delta Delta G vertical bar <= 1 kcal/mol, partially relates with the increasing number of the protein interacting partners, corroborating the notion that disease related, non-perturbing variations are likely to impair protein-protein interaction (70 % of the disease causing variations, with high accessible surface are indeed predicted in interacting sites). The set of OMIM surface accessible variations with vertical bar Delta Delta G vertical bar <= 1 kcal/mol and located in interaction sites are 23 % of the total in 161 proteins. Among these, 43 proteins with some 327 disease causing variations are involved in signalling, structural biological processes, development and differentiation. Conclusions: We compute the effect of disease causing variations on protein stability with INPS3D, a new state-of-the-art tool for predicting the change in Delta Delta G value associated to single residue substitution in protein structures. The analysis indicates that OMIM disease related variations in proteins promote a much larger effect on protein stability than polymorphisms non-associated to diseases. Disease related variations with a slight effect on protein stability (vertical bar Delta Delta G vertical bar < 1 kcal/mol) frequently occur at the protein accessible surface suggesting that they are located in protein-protein interactions patches in putative human biological functional networks. The hypothesis is corroborated by proving that proteins with many disease related variations that slightly perturb protein stability are on average more connected in the human physical interactome (IntAct) than proteins with variations predicted with vertical bar Delta Delta G vertical bar > 1 kcal/mol.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemistry & Molecular Biology

A Glance into MTHFR Deficiency at a Molecular Level

Castrense Savojardo, Giulia Babbi, Davide Baldazzi, Pier Luigi Martelli, Rita Casadio

Summary: This study investigates the association between MTHFR deficiency and protein structure variations. Using computational tools, the researchers explored the properties of 72 disease-associated missense variations in the MTHFR protein. They found that 61% of the variations destabilized the protein and corresponded to known deficiencies. The study also revealed unique patterns of disease-associated variations in the protein architecture of MTHFR.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2022)

Review Chemistry, Multidisciplinary

Machine learning solutions for predicting protein-protein interactions

Rita Casadio, Pier Luigi Martelli, Castrense Savojardo

Summary: Proteins aggregates, known as biomolecular condensates, affect biological processes, and machine learning algorithms can be used to understand protein-protein interactions. More research is needed to improve our knowledge in this area.

WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE (2022)

Editorial Material Biochemistry & Molecular Biology

Molecular Effects of Mutations in Human Genetic Diseases

Emanuela Leonardi, Castrense Savojardo, Giovanni Minervini

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2022)

Article Astronomy & Astrophysics

Mouse genomic associations with in vitro sensitivity to simulated space radiation

Egle Cekanaviciute, Duc Tran, Hung Nguyen, Alejandra Lopez Macha, Eloise Pariset, Sasha Langley, Giulia Babbi, Sherina Malkani, Sebastien Penninckx, Jonathan C. Schisler, Tin Nguyen, Gary H. Karpen, Sylvain. V. Costes

Summary: By studying 15 strains of mice, we identified single nucleotide polymorphisms associated with spontaneous and ionizing radiation-induced DNA damage. Statistical analysis revealed an association with pathways related to cellular signaling, metabolism, tumorigenesis, and nervous system damage. Our findings provide new mouse models for further studying the impact of ionizing radiation and validating genetic loci.

LIFE SCIENCES IN SPACE RESEARCH (2023)

Article Biochemical Research Methods

Highly significant improvement of protein sequence alignments with AlphaFold2

Athanasios Baltzis, Leila Mansouri, Suzanne Jin, Bjorn E. Langer, Ionas Erb, Cedric Notredame, Pier Luigi Martelli

Summary: This study found that protein sequence alignments estimated on AlphaFold2 predictions are nearly as accurate as those estimated on experimental structures, and they are significantly closer to the structural reference than sequence-based alignments. Furthermore, even low-quality AlphaFold2 structural models can be used to obtain highly accurate alignments. This suggests that AlphaFold2 encodes higher-order dependencies that can be utilized for sequence analysis.

BIOINFORMATICS (2022)

Review Biochemical Research Methods

Quantum computing algorithms: getting closer to critical problems in computational biology

Laura Marchetti, Riccardo Nifosi, Pier Luigi Martelli, Eleonora Da Pozzo, Valentina Cappello, Francesco Banterle, Maria Letizia Trincavelli, Claudia Martini, Massimo D'Elia

Summary: Recent biotechnological progress has enabled life scientists and physicians to access an unprecedented amount of data at all levels of biological complexity. Quantum computing approaches hold the promise to resolve, speed up, or refine the analysis of a wide range of computational problems.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Multidisciplinary Sciences

Mapping human disease-associated enzymes into Reactome allows characterization of disease groups and their interactions

Castrense Savojardo, Davide Baldazzi, Giulia Babbi, Pier Luigi Martelli, Rita Casadio

Summary: Based on databases such as OMIM, Humsavar, Clinvar and Monarch, this study analyzes the association between human enzymes and genetic diseases, finding that 75% of them are rare diseases. By adopting the Mondo ontology initiative and Reactome pathways, a comprehensive understanding and categorization of the molecular mechanisms of diseases have been achieved.

SCIENTIFIC REPORTS (2022)

Article Biochemistry & Molecular Biology

Wood feeding and social living: Draft genome of the subterranean termite Reticulitermes lucifugus (Blattodea; Termitoidae)

Jacopo Martelossi, Giobbe Forni, Mariangela Iannello, Castrense Savojardo, Pier Luigi Martelli, Rita Casadio, Barbara Mantovani, Andrea Luchetti, Omar Rota-Stabelli

Summary: This study reports the draft genome of the subterranean termite Reticulitermes lucifugus, an economically important species with extensive research on its eusocial organization and mating system. The assembly covered a significant portion of the estimated genome size and showed complete homozygosity. The study identified numerous gene duplications associated with R. lucifugus diversification and significant lineage-specific expansions in gene families related to development, perception, and nutrient metabolism pathways. Additionally, a detailed analysis of P450 and odorant receptor gene repertoires revealed a wide diversity and dynamic evolutionary history. Overall, this newly assembled genome provides valuable insights into termite biology and pest control.

INSECT MOLECULAR BIOLOGY (2023)

Article Biochemistry & Molecular Biology

ISPRED-SEQ: Deep Neural Networks and Embeddings for Predicting Interaction Sites in Protein Sequences

Matteo Manfredi, Castrense Savojardo, Pier Luigi Martelli, Rita Casadio

Summary: This article introduces a method called ISPRED-SEQ for predicting protein-protein interaction sites based on protein sequences. It utilizes recently developed protein language models and deep neural networks, outperforming other similar methods.

JOURNAL OF MOLECULAR BIOLOGY (2023)

Article Biochemistry & Molecular Biology

MultifacetedProtDB: a database of human proteins with multiple functions

Elisa Bertolini, Giulia Babbi, Castrense Savojardo, Pier Luigi Martelli, Rita Casadio

Summary: MultiFacetedProtDB is a database of multifunctional human proteins, collecting comprehensive annotations and allowing searches for gene, protein, and associated structural and functional information. Studying multifunctional proteins helps elucidate the complexities of biological processes, particularly their importance and wide applications in human biology.

NUCLEIC ACIDS RESEARCH (2023)

Article Biochemical Research Methods

CoCoNat: a novel method based on deep learning for coiled-coil prediction

Giovanni Madeo, Castrense Savojardo, Matteo Manfredi, Pier Luigi Martelli, Rita Casadio

Summary: This article introduces a novel method called CoCoNat for predicting coiled-coil helix boundaries, residue-level register annotation, and oligomerization state. The method combines two state-of-the-art protein language models and implements a deep learning procedure with a Grammatical-Restrained Hidden Conditional Random Field. CoCoNat outperforms current state-of-the-art methods in residue-level and segment-level CCD, as well as in register annotation and prediction of oligomerization states.

BIOINFORMATICS (2023)

No Data Available