4.5 Article

APL: An angle probability list to improve knowledge-based metaheuristics for the three-dimensional protein structure prediction

期刊

COMPUTATIONAL BIOLOGY AND CHEMISTRY
卷 59, 期 -, 页码 142-157

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.compbiolchem.2015.08.006

关键词

Three-dimensional protein structure prediction; Amino acid conformational preferences; Metaheuristics

资金

  1. FAPERGS [002021-25.51/13]
  2. MCT/CNPq, Brazil [473692/2013-9]
  3. Fondecyt Iniciacion Conicyt-Chile [11121288]

向作者/读者索取更多资源

Tertiary protein structure prediction is one of the most challenging problems in structural bioinformatics. Despite the advances in algorithm development and computational strategies, predicting the folded structure of a protein only from its amino acid sequence remains as an unsolved problem. We present a new computational approach to predict the native-like three-dimensional structure of proteins. Conformational preferences of amino acid residues and secondary structure information were obtained from protein templates stored in the Protein Data Bank and represented as an Angle Probability List. Two knowledge-based prediction methods based on Genetic Algorithms and Particle Swarm Optimization were developed using this information. The proposed method has been tested with twenty-six case studies selected to validate our approach with different classes of proteins and folding patterns. Stereochemical and structural analysis were performed for each predicted three-dimensional Structure. Results achieved suggest that the Angle Probability List can improve the effectiveness of metaheuristics used to predicted the three-dimensional structure of protein molecules by reducing its conformational search space. (C) 2015 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

Relevance aggregation for neural networks interpretability and knowledge discovery on tabular data

Bruno Iochins Grisci, Mathias J. Krause, Marcio Dorn

Summary: The study introduces a relevance aggregation algorithm that combines the relevance computed from multiple samples by a neural network to generate scores for each input feature. Two visualization methods for learned patterns were presented to enhance model comprehension. The method accurately identifies the most important features for network predictions.

INFORMATION SCIENCES (2021)

Article Chemistry, Multidisciplinary

Modifying the catalytic preference of alpha-amylase toward n-alkanes for bioremediation purposes using in silico strategies

Ederson Sales Moreira Pinto, Bruno Cesar Feltes, Conrado Pedebos, Marcio Dorn

Summary: Bioremediation is highlighted as a cost-effective and eco-friendly alternative to reverse the damage caused by oil pollution, but more efforts are needed to design enzymes for the process.

JOURNAL OF COMPUTATIONAL CHEMISTRY (2021)

Article Biology

A multi-modal algorithm based on an NSGA-II scheme for phylogenetic tree inference

Manuel Villalobos-Cid, Cesar Rivera, Eduardo I. Kessi-Perez, Mario Inostroza-Ponta

Summary: Analyzing evolutionary data provides important information about the adaptation and evolution of organisms. Optimization methods, particularly multiobjective optimization, are effective approaches for addressing this problem. In this paper, a multi-modal metaheuristic approach is used to tackle the multiobjective phylogenetic inference problem. A novel metric based on topological tree distance is introduced. Comparative analysis with state of the art algorithms and a case study on yeast dataset demonstrate the improved diversity and quality of solutions achieved by the proposed method.

BIOSYSTEMS (2022)

Article Computer Science, Interdisciplinary Applications

Gene Expression Variation Analysis (GEVA): A new R package to evaluate variations in differential expression in multiple biological conditions

Itamar Jose Guimaraes Nunes, Bruno Cesar Feltes, Murilo Zanini David, Marcio Dorn

Summary: This study presents a new R package called GEVA, which can analyze multiple gene expression datasets and identify genes with consistent differential expression across experiments. GEVA takes multiple differential expression analysis results as input and performs weighted summarization, partitioning, and clustering to find genes that vary less across experiments. The package also allows grouping of experimental conditions into factors, and performs additional statistical tests to identify genes specifically or dependently expressed in response to these factors. Results from the package were validated using knockout studies and were found to be consistent with published experimental studies. GEVA provides a robust alternative for multiple comparison analyses.

JOURNAL OF BIOMEDICAL INFORMATICS (2022)

Article Genetics & Heredity

Transcriptomic Analysis of Long Non-Coding RNA during Candida albicans Infection

Gabriela Flores Goncalves, Joice de Faria Poloni, Marcio Dorn

Summary: This study investigated the association between lncRNAs and host response to C. albicans infection. By analyzing a public RNA-Seq dataset, the researchers identified lncRNAs related to biological processes involved in immune response and reactive species production. These findings provide new insights into the functions of lncRNAs in the immune response.
Review Genetics & Heredity

Interdisciplinary Overview of Lipopeptide and Protein-Containing Biosurfactants

Regis Antonioli Junior, Joice de Faria Poloni, Ederson Sales Moreira Pinto, Marcio Dorn

Summary: Biosurfactants are amphipathic molecules that can lower interfacial and superficial tensions. They are biodegradable and have a wide diversity of categories, with lipopeptides being the most abundant and well-known. Protein-containing biosurfactants are less studied but could be a valuable alternative. Understanding the harsh conditions that target organisms can sustain is crucial for successful implementation. This article explores the biotechnological applications of lipopeptide and protein-containing biosurfactants, as well as their natural role and the potential research possibilities using meta-omics and machine learning.
Article Computer Science, Artificial Intelligence

An evolutionary algorithm based on parsimony for the multiobjective phylogenetic network inference problem

Manuel Villalobos-Cid, Marcio Dorn, Angela Contreras, Mario Inostroza-Ponta

Summary: Phylogenetic networks are used to capture evolutionary phenomena that cannot be described by phylogenetic trees, and different inference criteria may result in conflicting network topologies. Multi-objective optimization can handle conflicting objectives and reduce dependency on specific criteria. In this study, a multi-objective evolutionary algorithm called MO-PhyNet is proposed to infer phylogenetic networks by minimizing three criteria: hardwired parsimony, softwired parsimony, and the number of reticulations. The formalization of phylogenetic inference as a multi-objective optimization problem allows for the identification of different reticulated topologies representing distinct evolutionary hypotheses.

APPLIED SOFT COMPUTING (2023)

Article Computer Science, Software Engineering

Implicit propagation of directly addressed grids in lattice Boltzmann methods

Adrian Kummerlaender, Marcio Dorn, Martin Frank, Mathias J. Krause

Summary: This article revisits and extends the work on implicit propagation on directly addressed grids by considering them as transformations of the underlying space filling curve. A new periodic shift (PS) pattern is proposed that provides consistent performance across a range of targets. Benchmark results for PS and shift-swap-streaming (SSS) on SIMD CPUs and Nvidia GPUs are provided, and the application of PS as the propagation pattern of the open source LBM framework OpenLB is summarized.

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE (2023)

Article Chemistry, Multidisciplinary

NIAS-Server 2.0: A versatile complementary tool for structural biology studies

Bruno Cesar Feltes, Ederson Sales Moreira Pinto, Arthur Tonietto Mangini, Marcio Dorn

Summary: The NIAS server is a tool that analyzes the conformational preferences of amino acids in proteins. It is based on the Angle Probability List and integrates structural information from the Protein Data Bank. The updated NIAS server includes data from various experimental techniques and provides examples of its application in structural biology research, as well as discussions on its limitations.

JOURNAL OF COMPUTATIONAL CHEMISTRY (2023)

Article Biochemistry & Molecular Biology

Functional response of microbial communities in lab-controlled oil-contaminated marine sediment

Regis Antonioli Junior, Joice de Faria Poloni, Manuel Riveros A. Escalona, Marcio Dorn

Summary: Crude oil contamination poses a significant threat to the environment and biodiversity. However, certain microorganisms in contaminated areas can utilize hydrocarbons as a source of nutrients, highlighting the importance of understanding local community dynamics in these environments. Through the analysis of genetic and functional data, we identified the prevalence of hydrocarbon-degrading functionality in contaminated sediments and potential targets for bioremediation.

MOLECULAR OMICS (2023)

Article Biochemistry & Molecular Biology

Meta-analyses of host metagenomes from colorectal cancer patients reveal strong relationship between colorectal cancer-associated species

Manuel Adrian Riveros Escalona, Joice de Faria Poloni, Mathias J. Krause, Marcio Dorn

Summary: Colorectal cancer is commonly associated with changes in the gut microbiota, and recent studies have provided a better understanding of the specific species related to its development. However, the importance of certain species and their interactions across different datasets still needs further exploration.

MOLECULAR OMICS (2023)

Review Computer Science, Artificial Intelligence

The use of gene expression datasets in feature selection research: 20 years of inherent bias?

Bruno I. Grisci, Bruno Cesar Feltes, Joice de Faria Poloni, Pedro H. Narloch, Marcio Dorn

Summary: A review of over 1200 publications on feature selection and gene expression between 2010 and 2020 found that 57% of the publications used outdated datasets, 23% used only outdated data, and 32% did not cite data sources. Problems such as referencing unavailable databases, slow adoption of RNA-seq datasets, and bias towards human cancer data were also identified. These issues can result in inaccurate and misleading biological results.

WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY (2023)

Proceedings Paper Computer Science, Artificial Intelligence

A Multi-population Schema Designed for Biased Random-Key Genetic Algorithms on Continuous Optimisation Problems

Mateus Boiani, Rafael Stubs Parpinelli, Marcio Dorn

Summary: Population diversity plays a crucial role in determining the quality of solutions in Evolutionary Algorithms. This paper proposes a new migration policy for the BRKGA and compares its performance with traditional strategies in continuous search spaces. The results show that the proposal can improve the optimization capability of the BRKGA.

INTELLIGENT SYSTEMS, PT I (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Multi-objective prioritization for data center vulnerability remediation

Felipe Colombelli, Vitor Kehl Matter, Bruno Iochins Grisci, Leomar Lima, Karine Heinen, Marcio Borges, Sandro Jose Rigo, Jorge Luis Victoria Barbosa, Rodrigo Da Rosa Righi, Cristiano Andre da Costa, Gabriel De Oliveira Ramos

Summary: Securing information in data centers is a major challenge nowadays. Prioritizing vulnerabilities is crucial to prevent leaks and other security issues. However, there is limited research on intelligent methods for vulnerability prioritization. This study proposes a multi-objective method that ranks vulnerabilities based on user-chosen assessment metrics. Experimental results show that the proposed method can significantly reduce the number of vulnerabilities needed to achieve an organization's security targets compared to baseline analysis by a security team.

2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) (2022)

Article Biology

Netting into the Sophoretin pool: An approach to trace GSTP1 inhibitors for reversing chemoresistance

Kunal Bhattacharya, Shikha Mahato, Satyendra Deka, Nongmaithem Randhoni Chanu, Amit Kumar Shrivastava, Pukar Khanal

Summary: Chemoresistance, a major challenge in cancer treatment, is associated with the cellular glutathione-related detoxification system. A study has identified GSTP1 enzyme as critical in the inactivation of anticancer drugs and suggests the need for GSTP1 inhibitors to combat chemoresistance. Through molecular docking and simulations, the study found that quercetin 7-O-beta-D-glucoside showed promise as a potential candidate for addressing chemoresistance in cancer patients.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Structure and energetics of serum protein complex of tea adulterant dye Bismarck brown Y using experimental and computational methods

Manwi Shankar, Majji Sai Sudha Rani, Priyanka Gopi, P. Arsha, Prateek Pandya

Summary: This study investigates the interaction between the food dye BBY and the serum protein BSA. The results show that BBY binds to a specific site on BSA through hydrophobic interactions, affecting the structural stability of the protein. These findings enhance our understanding of the molecular-level interactions between BBY and BSA.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Implementing link prediction in protein networks via feature fusion models based on graph neural networks

Chi Zhang, Qian Gao, Ming Li, Tianfei Yu

Summary: In this study, we propose a graph neural network-based autoencoder model, AGraphSAGE, that effectively predicts protein-protein interactions across diverse biological species by integrating gene ontology.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Named entity recognition of rice genes and phenotypes based on BiGRU neural networks

Kangjie Wu, Liqian Xu, Xinxiang Li, Youhua Zhang, Zhenyu Yue, Yujia Gao, Yiqiong Chen

Summary: Named Entity Recognition (NER) is a crucial task in natural language processing (NLP) and big data analysis, with wide application range. This paper proposes an improved neural network method for NER of rice genes and phenotypes, which can learn semantic information in the context without feature engineering. Experimental results show that the proposed model outperforms other models.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

Revisiting structural organization of proteins at high temperature from a network perspective

Suman Hait, Sudip Kundu

Summary: Interactions between amino acids in proteins are crucial for stability and structural integrity. Thermophiles have more and more stable interactions to survive in extreme environments. Different types of interactions are enriched in different structural regions.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

XL1R-Net: Explainable AI-driven improved LI-regularized deep neural architecture for NSCLC biomarker identification

Kountay Dwivedi, Ankit Rajpal, Sheetal Rajpal, Virendra Kumar, Manoj Agarwal, Naveen Kumar

Summary: This study aims to identify biomarkers for non-small cell lung cancer (NSCLC) using copy number variation (CNV) data. A novel deep learning architecture, XL1R-Net, is proposed to improve the classification accuracy for NSCLC subtyping. Twenty NSCLC-relevant biomarkers are uncovered using explainable AI (XAI)-based feature identification. The results show that the identified biomarkers have high classification performance and clinical relevance. Additionally, twelve of the biomarkers are potentially druggable and eighteen of them have a high probability of predicting NSCLC patients' survival likelihood according to the Drug-Gene Interaction Database and the K-M Plotter tool, respectively. This research suggests that investigating these seven novel biomarkers can contribute to NSCLC therapy, and the integration of multiomics data and other sources will help better understand NSCLC heterogeneity.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)

Article Biology

AMPCDA: Prediction of circRNA-disease associations by utilizing attention mechanisms on metapaths

Pengli Lu, Wenqi Zhang, Jinkai Wu

Summary: Researchers have developed a computational method, AMPCDA, to predict circRNA-disease associations using predefined metapaths, achieving high predictive accuracy. This method effectively combines node embeddings with higher-order neighborhood representations and provides valuable guidance for revealing new disease mechanisms in biological research.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2024)