4.7 Article

Identification of subfamily-specific sites based on active sites modeling and clustering

Ask authors/readers for more resources

Motivation: Current computational approaches to function prediction are mostly based on protein sequence classification and transfer of annotation from known proteins to their closest homologous sequences relying on the orthology concept of function conservation. This approach suffers a major weakness: annotation reliability depends on global sequence similarity to known proteins and is poorly efficient for enzyme superfamilies that catalyze different reactions. Structural biology offers a different strategy to overcome the problem of annotation by adding information about protein 3D structures. This information can be used to identify amino acids located in active sites, focusing on detection of functional polymorphisms residues in an enzyme superfamily. Structural genomics programs are providing more and more novel protein structures at a high-throughput rate. However, there is still a huge gap between the number of sequences and available structures. Computational methods, such as homology modeling provides reliable approaches to bridge this gap and could be a new precise tool to annotate protein functions. Results: Here, we present Active Sites Modeling and Clustering (ASMC) method, a novel unsupervised method to classify sequences using structural information of protein pockets. ASMC combines homology modeling of family members, structural alignment of modeled active sites and a subsequent hierarchical conceptual classification. Comparison of profiles obtained from computed clusters allows the identification of residues correlated to subfamily function divergence, called specificity determining positions. ASMC method has been validated on a benchmark of 42 Pfam families for which previous resolved holo-structures were available. ASMC was also applied to several families containing known protein structures and comprehensive functional annotations. We will discuss how ASMC improves annotation and understanding of protein families functions by giving some specific illustrative examples on nucleotidyl cyclases, protein kinases and serine proteases.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemistry & Molecular Biology

Introducing Programming Skills for Life Science Students

Diego Mariano, Pedro Martins, Lucianna Helene Santos, Raquel Cardoso de Melo-Minardi

BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION (2019)

Article Biochemistry & Molecular Biology

Molecular Dynamics Gives New Insights into the Glucose Tolerance and Inhibition Mechanisms on β-Glucosidases

Leon Sulfierry Correa Costa, Diego Cesar Batista Mariano, Rafael Eduardo Oliveira Rocha, Johannes Kraml, Carlos Henrique da Silveira, Klaus Roman Liedl, Raquel Cardoso de Melo-Minardi, Leonardo Henrique Franca de Lima

MOLECULES (2019)

Article Biochemical Research Methods

visGReMLIN: graph mining-based detection and visualization of conserved motifs at 3D protein-ligand interface at the atomic level

Vagner S. Ribeiro, Charles A. Santana, Alexandre V. Fassio, Fabio R. Cerqueira, Carlos H. da Silveira, Joao P. R. Romanelli, Adriana Patarroyo-Vargas, Maria G. A. Oliveira, Valdete Goncalves-Almeida, Sandro C. Izidoro, Raquel C. de Melo-Minardi, Sabrina de A. Silveira

BMC BIOINFORMATICS (2020)

Article Biochemical Research Methods

Proteus: An algorithm for proposing stabilizing mutation pairs based on interactions observed in known protein 3D structures

Jose Renato M. S. Barroso, Diego Mariano, Sandro R. Dias, Rafael E. O. Rocha, Lucianna H. Santos, Ronaldo A. P. Nagem, Raquel C. de Melo-Minardi

BMC BIOINFORMATICS (2020)

Article Cell Biology

Glutantβase: a database for improving the rational design of glucose-tolerant β-glucosidases

Diego Mariano, Naiara Pantuza, Lucianna H. Santos, Rafael E. O. Rocha, Leonardo H. F. de Lima, Lucas Bleicher, Raquel Cardoso de Melo-Minardi

BMC MOLECULAR AND CELL BIOLOGY (2020)

Review Biochemistry & Molecular Biology

Structural and Functional Aspects of Targeting the Secreted Human Group IIA Phospholipase A(2)

Ryung Rae Kim, Zheng Chen, Timothy J. Mann, Karine Bastard, Kieran F. Scott, W. Bret Church

MOLECULES (2020)

Article Biochemical Research Methods

GRaSP: a graph-based residue neighborhood strategy to predict binding sites

Charles A. Santana, Sabrina de A. Silveira, Joao P. A. Moraes, Sandro C. Izidoro, Raquel C. de Melo-Minardi, Antonio J. M. Ribeiro, Jonathan D. Tyzack, Neera Borkakoti, Janet M. Thornton

BIOINFORMATICS (2020)

Article Biochemical Research Methods

Propedia: a database for protein-peptide identification based on a hybrid clustering algorithm

Pedro M. Martins, Lucianna H. Santos, Diego Mariano, Felippe C. Queiroz, Luana L. Bastos, Isabela de S. Gomes, Pedro H. C. Fischer, Rafael E. O. Rocha, Sabrina A. Silveira, Leonardo H. F. de Lima, Mariana T. Q. de Magalhaes, Maria G. A. Oliveira, Raquel C. de Melo-Minardi

Summary: Propedia is a comprehensive database that contains over 19,000 high-resolution protein-peptide complex structures, allowing for multi-faceted analysis and comparison of peptides through a hybrid clustering algorithm, supporting rational peptide design and research.

BMC BIOINFORMATICS (2021)

Article Oncology

Functional significance of germline EPAS1 variants

Trisha Dwight, Edward Kim, Karine Bastard, Diana E. Benn, Graeme Eisenhofer, Susan Richter, Massimo Mannelli, Elena Rapizzi, Aleksander Prejbisz, Mariola Peczkowska, Karel Pacak, Roderick Clifton-Bligh

Summary: Somatic mutations in EPAS1 are associated with various phenotypes, while the pathogenic potential of germline variants remains unclear. Functional studies showed that three germline variants of EPAS1 have similarities to somatic activating mutations in PPGL, suggesting they may act as modifiers in the expression of PPGL.

ENDOCRINE-RELATED CANCER (2021)

Article Biology

Protein structural bioinformatics: An overview

Vinicius de Almeida Paiva, Isabela de Souza Gomes, Cleiton Rodrigues Monteiro, Murillo Ventura Mendonca, Pedro Magalhaes Martins, Charles Abreu Santana, Valdete Goncalves-Almeida, Sandro Carvalho Izidoro, Raquel Cardoso de Melo-Minardi, Sabrina de Azevedo Silveira

Summary: Proteins play crucial roles in organisms, performing various functions in cells. Resources for protein structural bioinformatics, such as modeling, docking, dynamics, interaction, and mutation analysis, are usually scattered across online repositories, making it challenging for students and professionals to know which resources to use and where to find them. This paper introduces the main subareas of protein structural bioinformatics and discusses several online resources, aiming to provide an overview of this research field and serve as a starting point for those interested in building competencies in this area.

COMPUTERS IN BIOLOGY AND MEDICINE (2022)

Article Chemistry, Medicinal

Prioritizing Virtual Screening with Interpretable Interaction Fingerprints

Alexandre Fassio, Laura Shub, Luca Ponzoni, Jessica McKinley, Matthew J. O'Meara, Rafaela S. Ferreira, Michael J. Keiser, Raquel C. de Melo Minardi

Summary: This paper introduces a machine learning-based drug discovery method that utilizes the LUNA toolkit to calculate and encode protein-ligand interactions into new fingerprints. The method also provides visual strategies for interpretable fingerprints. Experimental results show that this method outperforms traditional fingerprints in reproducing scores and identifying similarities. Therefore, LUNA and its interface fingerprints are promising approaches for machine learning-based drug discovery.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Proceedings Paper Computer Science, Interdisciplinary Applications

Computational Methodology for Discovery of Potential Inhibitory Peptides

Vivian Morais Paixao, Raquel C. de Melo Minardi

ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2022 (2022)

Proceedings Paper Computer Science, Interdisciplinary Applications

Identifying Large Scale Conformational Changes in Proteins Through Distance Maps and Convolutional Networks

Lucas Moraes dos Santos, Raquel C. de Melo Minardi

Summary: This work presents the development of a convolutional neural network model to identify large scale conformational changes in proteins, using images that illustrate the interatomic distance matrices. The model achieves satisfactory results in cross validation, successfully distinguishing open and closed states of the S protein from SARS-CoV-2. The model shows potential for identifying even more subtle conformational changes in the future.

ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2022 (2022)

Article Biochemical Research Methods

nAPOLI: A Graph-Based Strategy to Detect and Visualize Conserved Protein-Ligand Interactions in Large-Scale

Alexandre V. Fassio, Lucianna H. Santos, Sabrina A. Silveira, Rafaela S. Ferreira, Raquel C. de Melo-Minardi

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (2020)

Article Computer Science, Cybernetics

Proteingo: Motivation, user experience, and learning of molecular interactions in biological complexes

Marcos F. M. Silva, Pedro M. Martins, Diego C. B. Mariano, Lucianna Helene Santos, Isabela Pastorini, Naiara Pantuza, Cristiane N. Nobre, Raquel C. de Melo-Minardi, Proteingo Players

ENTERTAINMENT COMPUTING (2019)

No Data Available