4.6 Review

Bringing Molecules Back into Molecular Evolution

期刊

PLOS COMPUTATIONAL BIOLOGY
卷 8, 期 6, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1002572

关键词

-

资金

  1. NIH grant [R01 GM088344]
  2. NSF Cooperative Agreement [DBI-0939454]

向作者/读者索取更多资源

Much molecular-evolution research is concerned with sequence analysis. Yet these sequences represent real, three-dimensional molecules with complex structure and function. Here I highlight a growing trend in the field to incorporate molecular structure and function into computational molecular-evolution work. I consider three focus areas: reconstruction and analysis of past evolutionary events, such as phylogenetic inference or methods to infer selection pressures; development of toy models and simulations to identify fundamental principles of molecular evolution; and atom-level, highly realistic computational modeling of molecular structure and function aimed at making predictions about possible future evolutionary events.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biochemistry & Molecular Biology

Moderate Amounts of Epistasis are Not Evolutionarily Stable in Small Populations

Dariya K. Sydykova, Thomas LaBar, Christoph Adami, Claus O. Wilke

JOURNAL OF MOLECULAR EVOLUTION (2020)

Article Microbiology

A Comprehensive Coexpression Network Analysis in Vibrio cholerae

Cory D. DuPai, Claus O. Wilke, Bryan W. Davies

MSYSTEMS (2020)

Article Biochemistry & Molecular Biology

Site-Specific Amino Acid Distributions Follow a Universal Shape

Mackenzie M. Johnson, Claus O. Wilke

JOURNAL OF MOLECULAR EVOLUTION (2020)

Article Biochemistry & Molecular Biology

A systematic analysis of the beta hairpin motif in the Protein Data Bank

Cory D. DuPai, Bryan W. Davies, Claus O. Wilke

Summary: The beta hairpin motif is a widely distributed protein structural motif known for its stability and adaptability to various functions. Research revealed that specific amino acid preferences within the structure, particularly in the turn region, depend on subtle dynamics associated with turn length and secondary structure. Broad design principles, such as including specific aspartic acid residues and considering desired secondary structure in selecting residues for the turn region, were established for generating libraries encoding proteins or peptides containing beta hairpin structures.

PROTEIN SCIENCE (2021)

Article Microbiology

TsrA Regulates Virulence and Intestinal Colonization in Vibrio cholerae

Cory D. Dupai, Ashley L. Cunningham, Aaron R. Conrado, Claus O. Wilke, Bryan W. Davies

MSPHERE (2020)

Article Multidisciplinary Sciences

Modeling poliovirus replication dynamics from live time-lapse single-cell imaging data

Ashley Teufel, Wu Liu, Jeremy A. Draghi, Craig E. Cameron, Claus O. Wilke

Summary: Viruses experience selective pressure on the timing and order of events during infection. A method has been developed to examine the dynamic phenotypic landscape that viruses must face during replication. Parameters associated with translation and early stage viral replication processes are essential for capturing experimentally observed dynamics.

SCIENTIFIC REPORTS (2021)

Article Multidisciplinary Sciences

BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains

Adam J. Hockenberry, Claus O. Wilke

Summary: BACPHLIP is a tool that accurately predicts the lifestyle of bacteriophages by detecting conserved protein domains and using a trained classifier. It achieved an accuracy of 98% on a test set, greatly exceeding previous tools.
Article Biophysics

Learning the local landscape of protein structures with convolutional neural networks

Anastasiya Kulikova, Daniel J. Diaz, James M. Loy, Andrew D. Ellington, Claus O. Wilke

Summary: The study investigates the use of 3D convolutional neural networks for predicting protein structures from amino acid sequences. The network can accurately predict wild type with good confidence levels, but has less accuracy in predicting consensus, primarily driven by whether or not the consensus matches the wild type. High-confidence mis-predictions of the wild type may indicate sites that are primed for mutation and could be targets for protein engineering.

JOURNAL OF BIOLOGICAL PHYSICS (2021)

Article Multidisciplinary Sciences

Generating dynamic gene expression patterns without the need for regulatory circuits

Sahil B. Shah, Alexis M. Hill, Claus O. Wilke, Adam J. Hockenberry

Summary: This study demonstrates that by varying the placement and strengths of promoters, terminators, and RNase cleavage sites, it is possible to achieve various basic gene expression patterns, without introducing auxiliary gene products.

PLOS ONE (2022)

Article Biotechnology & Applied Microbiology

Evidence for Widespread Class II Microcins in Enterobacterales Genomes

T. Jeffrey Cole, Jennifer K. Parker, Aaron L. Feller, Claus O. Wilke, Bryan W. Davies

Summary: Class II microcins are small bacteriocins produced by Gram-negative bacteria in the Enterobacteriaceae, playing a role in interbacterial competition. Despite limited direct evidence, they show a diverse array of molecular functionalities and target specificities, suggesting untapped potential for discovering peptide antibiotics.

APPLIED AND ENVIRONMENTAL MICROBIOLOGY (2022)

Article Biochemistry & Molecular Biology

Using machine learning to predict the effects and consequences of mutations in proteins

Daniel J. Diaz, Anastasiya V. Kulikova, Andrew D. Ellington, Claus O. Wilke

Summary: Machine and deep learning approaches utilize protein datasets to predict variants with improved fitness. Data availability and quality are more important constraints than the specifics of the algorithms. Even with limited experimental data, pre-training with generic protein datasets can still perform well with subsequent refinement via hybrid or transfer learning. Machine learning approaches will likely have a major role in future breakthroughs in protein biochemistry and engineering.

CURRENT OPINION IN STRUCTURAL BIOLOGY (2023)

Article Multidisciplinary Sciences

Designing and identifying β-hairpin peptide macrocycles with antibiotic potential

Justin R. Randall, Cory D. DuPai, T. Jeffrey Cole, Gillian Davidson, Kyra E. Groover, Sabrina L. Slater, Despoina A. I. Mavridou, Claus O. Wilke, Bryan W. Davies

Summary: This article introduces a method for designing a large peptide library with macrocyclic beta-hairpin structure. Through activity-driven high-throughput screening, dozens of peptides that kill bacteria through selective membrane disruption are identified, and their biochemical features are analyzed using machine learning. Active peptides have a unique constrained structure and are highly enriched in cationic charge with arginine in their turn region. This study provides a synthetic strategy for structured macrocyclic peptide design and discovery, while also elucidating important characteristics for beta-hairpin antimicrobial peptide activity.

SCIENCE ADVANCES (2023)

Article Biochemistry & Molecular Biology

Growth-dependent Gene Expression Variation Influences the Strength of Codon Usage Biases

Mackenzie M. Johnson, Adam J. Hockenberry, Matthew J. Mcguffie, Luiz Carlos Vieira, Claus O. Wilke

Summary: Variation in gene expression is closely associated with codon usage bias, especially under rapid growth conditions. Growth rate-dependent expression variation is an important constraint that influences the evolution of gene sequences in microorganisms.

MOLECULAR BIOLOGY AND EVOLUTION (2023)

Article Multidisciplinary Sciences

Two sequence- and two structure-based ML models have learned different aspects of protein biochemistry

Anastasiya V. Kulikova, Daniel J. Diaz, Tianlong Chen, T. Jeffrey Cole, Andrew D. Ellington, Claus O. Wilke

Summary: This study compares two different types of deep learning models and finds that they have distinct strengths and weaknesses. The overall prediction accuracies between sequence-based and structure-based models are largely uncorrelated. The two structure-based models perform better in predicting buried residues, while the two LLMs perform better in predicting solvent-exposed amino acids. Finally, combining the individual model predictions can significantly improve the overall prediction accuracy.

SCIENTIFIC REPORTS (2023)

Article Multidisciplinary Sciences

Metagenomic discovery of CRISPR-associated transposons

James R. Rybarski, Kuang Hu, Alexis M. Hill, Claus O. Wilke, Ilya J. Finkelstein

Summary: A bioinformatic search has led to the discovery of new CRISPR-associated Tn7 transposons (CASTs), revealing different structures and evolution processes of the CAST systems.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2021)

暂无数据