4.6 Article

The tangled bank of amino acids

Journal

PROTEIN SCIENCE
Volume 25, Issue 7, Pages 1354-1362

Publisher

WILEY
DOI: 10.1002/pro.2930

Keywords

protein evolution; molecular evolution; epistasis; epistatic interactions; substitution matrices; substitution rates; evolutionary process; phylogenetics; evolutionary Stokes shift

Funding

  1. National Institutes of Health [R01 GM083127]
  2. Medical Research Council (MRC) UK
  3. Medical Research Council [MC_U117573805, MC_PC_13056] Funding Source: researchfish
  4. MRC [MC_PC_13056, MC_U117573805] Funding Source: UKRI

Ask authors/readers for more resources

The use of amino acid substitution matrices to model protein evolution has yielded important insights into both the evolutionary process and the properties of specific protein families. In order to make these models tractable, standard substitution matrices represent the average results of the evolutionary process rather than the underlying molecular biophysics and population genetics, treating proteins as a set of independently evolving sites rather than as an integrated biomolecular entity. With advances in computing and the increasing availability of sequence data, we now have an opportunity to move beyond current substitution matrices to more interpretable mechanistic models with greater fidelity to the evolutionary process of mutation and selection and the holistic nature of the selective constraints. As part of this endeavour, we consider how epistatic interactions induce spatial and temporal rate heterogeneity, and demonstrate how these generally ignored factors can reconcile standard substitution rate matrices and the underlying biology, allowing us to better understand the meaning of these substitution rates. Using computational simulations of protein evolution, we can demonstrate the importance of both spatial and temporal heterogeneity in modelling protein evolution.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Editorial Material Biochemistry & Molecular Biology

Society for Molecular Biology and Evolution, Council and Business Meetings, 2019, Manchester, United Kingdom

David D. Pollock

MOLECULAR BIOLOGY AND EVOLUTION (2020)

Article Genetics & Heredity

Finding and extending ancient simple sequence repeat-derived regions in the human genome

Jonathan A. Shortt, Robert P. Ruggiero, Corey Cox, Aaron C. Wacholder, David D. Pollock

MOBILE DNA (2020)

Article Medicine, General & Internal

Dynamics of within-host Mycobacterium tuberculosis diversity and heteroresistance during treatment

Camus Nimmo, Kayleen Brien, James Millard, Alison D. Grant, Nesri Padayatchi, Alexander S. Pym, Max O'Donnell, Richard Goldstein, Judith Breuer, Francois Balloux

EBIOMEDICINE (2020)

Article Biochemistry & Molecular Biology

Viral CpG Deficiency Provides No Evidence That Dogs Were Intermediate Hosts for SARS-CoV-2

David D. Pollock, Todd A. Castoe, Blair W. Perry, Spyros Lytras, Kristen J. Wade, David L. Robertson, Edward C. Holmes, Maciej F. Boni, Sergei L. Kosakovsky Pond, Rhys Parry, Elizabeth J. Carlton, James L. N. Wood, Pleuni S. Pennings, Richard A. Goldstein

MOLECULAR BIOLOGY AND EVOLUTION (2020)

Article Multidisciplinary Sciences

Amalgamated cross-species transcriptomes reveal organ-specific propensity in gene expression evolution

Kenji Fukushima, David D. Pollock

NATURE COMMUNICATIONS (2020)

Article Virology

Preadaptation of pandemic GII.4 noroviruses in unsampled virus reservoirs years before emergence

Christopher Ruis, Lisa C. Lindesmith, Michael L. Mallory, Paul D. Brewer-Jensen, Josephine M. Bryant, Veronica Costantini, Christopher Monit, Jan Vinje, Ralph S. Baric, Richard A. Goldstein, Judith Breuer

VIRUS EVOLUTION (2020)

Article Infectious Diseases

Patterns of relatedness and genetic diversity inferred from whole genome sequencing of archival blood fluke miracidia (Schistosoma japonicum)

Zachary L. Nikolakis, Nicole R. Hales, Blair W. Perry, Drew R. Schield, Laura E. Timm, Yang Liu, Bo Zhong, Katerina J. Kechris, Elizabeth J. Carlton, David D. Pollock, Todd A. Castoe

Summary: Genomic approaches show potential in studying transmission patterns and responses to control efforts for diseases like schistosomiasis. This study demonstrates the use of whole genome amplification and sequencing to analyze individual archival schistosome larvae, providing insights into genetic diversity, relatedness, and transmission patterns. The results suggest that population-level WGS datasets for individual larvae could be valuable for disease control efforts, offering new avenues for understanding parasite biology and transmission pathways.

PLOS NEGLECTED TROPICAL DISEASES (2021)

Article Multidisciplinary Sciences

SARS-CoV-2 evolution during treatment of chronic infection

Steven A. Kemp, Dami A. Collier, Rawlings P. Datir, Isabella A. T. M. Ferreira, Salma Gayed, Aminu Jahun, Myra Hosmillo, Chloe Rees-Spear, Petra Mlcochova, Ines Ushiro Lumb, David J. Roberts, Anita Chandra, Nigel Temperton, Katherine Sharrocks, Elizabeth Blane, Yorgo Modis, Kendra E. Leigh, John A. G. Briggs, Marit J. van Gils, Kenneth G. C. Smith, John R. Bradley, Chris Smith, Rainer Doffinger, Lourdes Ceron-Gutierrez, Gabriela Barcenas-Morales, David D. Pollock, Richard A. Goldstein, Anna Smielewska, Jordan P. Skittrall, Theodore Gouliouris, Ian G. Goodfellow, Effrossyni Gkrania-Klotsas, Christopher J. R. Illingworth, Laura E. McCoy, Ravindra K. Gupta

Summary: Chronic infection with SARS-CoV-2 can lead to viral evolution and increased resistance to neutralizing antibodies in immunosuppressed individuals treated with convalescent plasma. During convalescent plasma therapy, there were significant shifts in the viral population structure and sensitivity, suggesting strong selection pressure on the virus during treatment.

NATURE (2021)

Article Multidisciplinary Sciences

Selection for cooperativity causes epistasis predominately between native contacts and enables epistasis-based structure reconstruction

R. Charlotte Eccleston, David D. Pollock, Richard A. Goldstein

Summary: Epistasis and cooperativity in protein folding are both influenced by networks of energetic interactions within proteins, and their selection can affect each other. Selection for cooperativity may be crucial for predicting protein structure using epistasis measurements.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2021)

Correction Multidisciplinary Sciences

Amalgamated cross-species transcriptomes reveal organ-specific propensity in gene expression evolution (vol 11, 4459, 2020)

Kenji Fukushima, David D. Pollock

NATURE COMMUNICATIONS (2021)

Editorial Material Parasitology

A genomic can of worms for schistosome host-specificity

Zachary L. Nikolakis, Elizabeth J. Carlton, David D. Pollock, Todd A. Castoe

Summary: Luo et al. identified genes associated with host-switching in Schistosoma japonicum, providing a guide for studying selected chromosomal genes using population genetics.

TRENDS IN PARASITOLOGY (2022)

Article Virology

Haplotype assignment of longitudinal viral deep sequencing data using covariation of variant frequencies

Cristina Venturini, Juanita Pang, Asif U. Tamuri, Sunando Roy, Claire Atkinson, Paul Griffiths, Judith Breuer, Richard A. Goldstein

Summary: This study presents a new approach called HaROLD for haplotype reconstruction of slowly evolving viruses based on identifying co-varying variant frequencies using a probabilistic framework. It demonstrates high accuracy in synthetic and clinical datasets.

VIRUS EVOLUTION (2022)

Article Ecology

Detecting macroevolutionary genotype-phenotype associations using error-corrected rates of protein convergence

Kenji Fukushima, David D. Pollock

Summary: To address the problem of convergent evolution on macroevolutionary timescales, we developed a new metric called omega(C) that measures error-corrected convergence rate of protein evolution. Using gene expression data, we explored over 20 million branch combinations in vertebrate genes and identified the joint convergence of expression patterns and protein sequences with amino acid substitutions in functionally important sites, providing hypotheses on undiscovered phenotypes.

NATURE ECOLOGY & EVOLUTION (2023)

Article Multidisciplinary Sciences

Genomic and geographical structure of human cytomegalovirus

Oscar J. Charles, Cristina Venturini, Soren Gantt, Claire Atkinson, Paul Griffiths, Richard A. Goldstein, Judith Breuer

Summary: Human cytomegalovirus (CMV), a highly variable virus, infects most of the world's population and shows both conserved and variable regions in its genome. The conserved regions and a minority of the variable regions display geographic population structure, while other variable regions are not geographically structured and are enriched for genes encoding immunomodulatory functions. This suggests the presence of at least two CMV founder populations and a second evolutionary process of balancing selection. Understanding these processes is crucial for vaccine development and drug target identification.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2023)

No Data Available