4.5 Review

The imprint of codons on protein structure

Journal

BIOTECHNOLOGY JOURNAL
Volume 6, Issue 6, Pages 641-649

Publisher

WILEY-V C H VERLAG GMBH
DOI: 10.1002/biot.201000329

Keywords

Codon usage; In vivo protein folding; Synonymous codons; Translation speed; tRNA

Funding

  1. Engineering and Physical Sciences Research Council
  2. Biotechnology and Biological Sciences Research Council

Ask authors/readers for more resources

The central dogma of biology outlines the unidirectional flow of interpretable data from genetic sequence to protein sequence. This has led to the idea that a protein's structure is dependent only on its amino acid sequence and not its genetic sequence. Recently, however, a more than transient link between the coding genetic sequence and the protein structure has become apparent. The two interact at the ribosome via the process of co-translational protein folding. Evidence for co-translational folding is growing rapidly, but the influence of codons on the protein structure attained is still highly contentious. It is theorised that the speed of codon translation modulates the time available for protein folding and hence the protein structure. Here, past and present research regarding synonymous codons and codon translation speed are reviewed within the context of protein structure attainment.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemical Research Methods

Co-evolutionary distance predictions contain flexibility information

Dominik Schwarz, Guy Georges, Sebastian Kelm, Jiye Shi, Anna Vangone, Charlotte M. Deane

Summary: Co-evolution analysis and machine-learning techniques have improved the accuracy of predicting residue-residue contacts and distances in protein structures. The shape of predicted distance distributions contains information on the rigidity or flexibility of a protein and its constituent residues.

BIOINFORMATICS (2022)

Article Biochemical Research Methods

DLAB: deep learning methods for structure-based virtual screening of antibodies

Constantin Schneider, Andrew Buchanan, Bruck Taddese, Charlotte M. Deane

Summary: DLAB can be used to improve antibody-antigen docking and structure-based virtual screening, enhancing pose ranking for antibody docking experiments and selection of accurate and correctly ranked antibody-antigen pairings.

BIOINFORMATICS (2022)

Article Chemistry, Medicinal

Learning from Docked Ligands: Ligand-Based Features Rescue Structure-Based Scoring Functions When Trained on Docked Poses

Fergus Boyles, Charlotte M. Deane, Garrett M. Morris

Summary: Machine learning scoring functions for protein-ligand binding affinity perform better on crystal structures than on docked poses, but a hybrid scoring function combining structure-based and ligand-based features shows comparable performance on docked poses to purely structure-based scoring functions trained on crystal poses. However, the hybrid scoring function may not always generalize well to protein targets not represented in the training set, indicating the need for improved scoring functions and additional validation benchmarks.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Article Biology

Empirical Bayes functional models for hydrogen deuterium exchange mass spectrometry

Oliver M. Crook, Chun-wa Chung, Charlotte M. Deane

Summary: Hydrogen deuterium exchange mass spectrometry (HDX-MS) is a technique to explore differential protein structure. We developed a new statistical testing procedure that incorporates a hydrogen deuterium exchange model and an empirical Bayes method to improve statistical power and interpretation. The flexibility of our method allows it to be applied in more experimental scenarios, reducing the burden on experimentalists to conduct excessive replicates.

COMMUNICATIONS BIOLOGY (2022)

Article Economics

Moving preinduction cervical ripening to a lower acuity inpatient setting using the synthetic hygroscopic cervical dilator: a cost-consequence analysis for the United States

Sita J. Saunders, Jody L. Grisamore, Tess Wong, Rafael Torrejon Torres, Rhodri Saunders, Brett Einerson

Summary: Utilizing the synthetic hygroscopic cervical dilator (SHCD) to introduce a low-acuity care room (ripening room) for cervical ripening as a part of labor induction can reduce the burden of care provision without compromising safety.

JOURNAL OF MEDICAL ECONOMICS (2022)

Article Chemistry, Medicinal

Fragment Merging Using a Graph Database Samples Different Catalogue Space than Similarity Search

Stephanie Wills, Ruben Sanchez-Garcia, Tim Dudgeon, Stephen D. Roughley, Andy Merritt, Roderick E. Hubbard, James Davidson, Frank von Delft, Charlotte M. Deane

Summary: Fragment merging is a promising approach to advancing fragments directly to on-scale potency. Searching commercial catalogues allows for quick and cost-effective identification of these merges, circumventing the challenge of synthetic accessibility.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Biology

ImmuneBuilder: Deep-Learning models for predicting the structures of immune proteins

Brennan Abanades, Wing Ki Wong, Fergus Boyles, Guy Georges, Alexander Bujotzek, Charlotte M. Deane

Summary: Immune receptor proteins are important in the immune system and have potential as biotherapeutics. A deep learning model called ImmuneBuilder is presented, which accurately predicts the structure of antibodies, nanobodies, and T-cell receptors. ImmuneBuilder outperforms AlphaFold2 in terms of accuracy and speed.

COMMUNICATIONS BIOLOGY (2023)

Article Computer Science, Artificial Intelligence

Testing the limits of SMILES-based de novo molecular generation with curriculum and deep reinforcement learning

Maranga Mokaya, Fergus Imrie, Willem P. van Hoorn, Aleksandra Kalisz, Anthony R. Bradley, Charlotte M. Deane

Summary: Deep reinforcement learning methods are potentially powerful tools for de novo design in the field of drug discovery. However, when there are few or no examples of desired molecules in the training data, recurrent neural network-based methods may have limitations in generating diverse and controllable molecular sets. To address these issues, a new curriculum-learning-inspired recurrent iterative optimization procedure is proposed, which allows the optimization of generated molecules for both known and unknown molecular profiles and provides control over exploration and exploitation. This method has been shown to generate specific and diverse sets of molecules with significantly more scaffolds compared to standard methods, although the choice of simplified molecular-input line-entry system (SMILES) representation has been found to affect the success of molecular optimization.

NATURE MACHINE INTELLIGENCE (2023)

Article Biochemical Research Methods

A Functional Bayesian Model for Hydrogen-Deuterium Exchange Mass Spectrometry

Oliver M. Crook, Nathan Gittens, Chun-wa Chung, Charlotte M. Deane

Summary: Proteins undergo structural perturbations when binding to others or subjected to environmental changes. HDX-MS can explore conformational changes in proteins by examining differences in deuterium incorporation rate. We propose a flexible Bayesian framework that improves stability, allows uncertainty quantification, and calculates statistical quantities that are inaccessible to other methods. Our findings demonstrate that a Bayesian approach can identify important binding epitopes from HDX data with consistent results.

JOURNAL OF PROTEOME RESEARCH (2023)

Editorial Material Biotechnology & Applied Microbiology

Perfecting antibodies with language models

Carlos Outeiral, Charlotte M. Deane

Summary: This study presents a general-purpose protein language model that can rapidly improve antibody properties.

NATURE BIOTECHNOLOGY (2023)

Article Health Care Sciences & Services

Evaluating the Environmental Impact of Single-Use and Multi-Use Surgical Staplers with Staple Line Buttressing in Laparoscopic Bariatric Surgery

Markus Meissner, Juliane Hafermann, Ubong Silas, Rhodri Saunders

Summary: Operation rooms have a significant environmental impact. Single-use staplers contribute to waste generation, while multi-use staplers can greatly reduce the environmental impact of surgery. Using multi-use staplers with pre-attached buttressing instead of single-use staplers with separate buttressing greatly reduces waste, resource consumption, and greenhouse gas emissions.

RISK MANAGEMENT AND HEALTHCARE POLICY (2023)

Article Multidisciplinary Sciences

KA-Search, a method for rapid and exhaustive sequence identity search of known antibodies

Tobias H. Olsen, Brennan Abanades, Iain H. Moal, Charlotte M. Deane

Summary: KA-Search is a tool for rapid searching of antibody variable domains by amino acid sequence identity. It can predict antibody properties and find the most similar sequences from billions of antibody sequences within minutes. Examples of insights obtained using KA-Search are provided.

SCIENTIFIC REPORTS (2023)

Article Chemistry, Multidisciplinary

CoPriNet: graph neural networks provide accurate and rapid compound price prediction for molecule prioritisation

Ruben Sanchez-Garcia, David Havasi, Gergely Takacs, Matthew C. Robinson, Alpha Lee, Frank von Delft, Charlotte M. Deane

Summary: Compound availability is crucial in the drug discovery process. Synthetic accessibility scores have been used as proxies for compound availability, but they are not effective in representing compound prices. In this paper, we propose a deep learning model called CoPriNet, which can accurately predict compound prices and has fast execution times, making it suitable for high-throughput experiments.

DIGITAL DISCOVERY (2023)

Article Biochemistry & Molecular Biology

The Patent and Literature Antibody Database (PLAbDab): an evolving reference set of functionally diverse, literature-annotated antibody sequences and structures

Brennan Abanades, Tobias H. Olsen, Matthew I. J. Raybould, Broncio Aguilar-Sanjuan, Wing Ki Wong, Guy Georges, Alexander Bujotzek, Charlotte M. Deane

Summary: The study introduces PLAbDab, a self-updating database of antibodies that contains over 150,000 paired antibody sequences and 3D structural models. It allows users to search by sequence, structure, or keyword, and provides functions such as annotating query antibodies, analyzing structural models, and facilitating the compilation of customized datasets.

NUCLEIC ACIDS RESEARCH (2023)

Article Biochemical Research Methods

Paragraph-antibody paratope prediction using graph neural networks with minimal feature vectors

Lewis Chinery, Newton Wahome, Iain Moal, Charlotte M. Deane

Summary: The development of new vaccines and antibody therapeutics is a time-consuming and costly process. In this study, a structure-based paratope prediction tool called Paragraph is introduced, which outperforms existing tools by using simpler feature vectors and no antigen information.

BIOINFORMATICS (2023)

No Data Available