4.7 Review

On the Potential of Machine Learning to Examine the Relationship Between Sequence, Structure, Dynamics and Function of Intrinsically Disordered Proteins

期刊

JOURNAL OF MOLECULAR BIOLOGY
卷 433, 期 20, 页码 -

出版社

ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD
DOI: 10.1016/j.jmb.2021.167196

关键词

machine learning; intrinsically disordered protein; molecular complex; condensate; SLiM

资金

  1. Novo Nordisk Challenge Programme REPIN [NNF18OC0033926]
  2. Lundbeck Foundation BRAINSTRUC initiative in structural biology [R155-2015-2666]
  3. Novo Nordisk Challenge Programme PRISM [NNF18OC0033950]

向作者/读者索取更多资源

Intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) play important roles in a wide range of biological functions and reveal novel mechanisms of interactions. Computational methods, including machine learning, are crucial for predicting the structures and functions of IDPs and IDRs. Experiments provide insights into complexes and may enable more accurate predictions.
Intrinsically disordered proteins (IDPs) constitute a broad set of proteins with few uniting and many diverging properties. IDPs-and intrinsically disordered regions (IDRs) interspersed between folded domains-are generally characterized as having no persistent tertiary structure; instead they interconvert between a large number of different and often expanded structures. IDPs and IDRs are involved in an enormously wide range of biological functions and reveal novel mechanisms of interactions, and while they defy the common structure-function paradigm of folded proteins, their structural preferences and dynamics are important for their function. We here discuss open questions in the field of IDPs and IDRs, focusing on areas where machine learning and other computational methods play a role. We discuss computational methods aimed to predict transiently formed local and long-range structure, including methods for integrative structural biology. We discuss the many different ways in which IDPs and IDRs can bind to other molecules, both via short linear motifs, as well as in the formation of larger dynamic complexes such as biomolecular condensates. We discuss how experiments are providing insight into such complexes and may enable more accurate predictions. Finally, we discuss the role of IDPs in disease and how new methods are needed to interpret the mechanistic effects of genomic variants in IDPs. (C) 2021 The Author(s). Published by Elsevier Ltd.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biophysics

Assessment of models for calculating the hydrodynamic radius of intrinsically disordered proteins

Francesco Pesce, Estella A. Newcombe, Pernille Seiffert, Emil E. Tranchant, Johan G. Olsen, Christy R. Grace, Birthe B. Kragelund, Kresten Lindorff-Larsen

Summary: Diffusion measurements by pulsed-field gradient NMR and fluorescence correlation spectroscopy can be used to probe the hydrodynamic radius of proteins. To tackle the accuracy uncertainty issue in computing the hydrodynamic radius from atomic coordinates, conformational ensembles of intrinsically disordered proteins were built and compared with measurements of compaction. The Kirkwood-Riseman equation was found to provide the best description of the hydrodynamic radius probed by pulsed-field gradient NMR ex-periments.

BIOPHYSICAL JOURNAL (2023)

Review Oncology

Lynch syndrome, molecular mechanisms and variant classification

Amanda B. Abildgaard, Sofie Nielsen, Inge Bernstein, Amelie Stein, Kresten Lindorff-Larsen, Rasmus Hartmann-Petersen

Summary: Accurate diagnosis and clinical interpretation of individual variants are crucial for the treatment of Lynch syndrome, a heritable cancer disease. Traditional protein variant classification methods are complex, but recent developments in high-throughput technologies and computational prediction tools offer new possibilities for assessing variants of unknown significance and gaining mechanistic insights into the disease.

BRITISH JOURNAL OF CANCER (2023)

Article Biochemistry & Molecular Biology

HSP70-binding motifs function as protein quality control degrons

Amanda B. Abildgaard, Vasileios Voutsinos, Soren D. Petersen, Fia B. Larsen, Caroline Kampmeyer, Kristoffer E. Johansson, Amelie Stein, Tommer Ravid, Claes Andreasson, Michael K. Jensen, Kresten Lindorff-Larsen, Rasmus Hartmann-Petersen

Summary: Protein quality control (PQC) degrons are short protein segments that target misfolded proteins for proteasomal degradation, and chaperone-binding regions may function as PQC degrons. A canonical Hsp70-binding motif, the APPY peptide, functions as a dose-dependent PQC degron in yeast and human cells. The number of exposed Hsp70-binding sites in the yeast proteome correlates with reduced protein abundance and half-life.

CELLULAR AND MOLECULAR LIFE SCIENCES (2023)

Article Biochemistry & Molecular Biology

Prediction of Quality-control Degradation Signals in Yeast Proteins

Kristoffer E. Johansson, Bayan Mashahreh, Rasmus Hartmann-Petersen, Tommer Ravid, Kresten Lindorff-Larsen

Summary: Effective proteome homeostasis is crucial for cell and organism survival. Cells have efficient quality control systems to monitor and remove misfolded proteins. The nature and sequence properties of quality-control degrons are still unknown.

JOURNAL OF MOLECULAR BIOLOGY (2023)

Article Biology

Evolutionary fine-tuning of residual helix structure in disordered proteins manifests in complex structure and lifetime

Steffie Elkjaer, Amanda D. D. Due, Lise F. F. Christensen, Frederik F. Theisen, Lasse Staby, Birthe B. B. Kragelund, Karen Skriver

Summary: Evolution-guided mutagenesis and biophysical analysis reveal that residual helical structure in the binding region of an intrinsically disordered protein regulates the lifetime of its complex by affecting its dissociation.

COMMUNICATIONS BIOLOGY (2023)

Article Biochemistry & Molecular Biology

Rare Catechol-O-methyltransferase Missense Variants Are Structurally Unstable Proteasome Targets

Fia B. Larsen, Matteo Cagiada, Jonas Dideriksen, Amelie Stein, Kresten Lindorff-Larsen, Rasmus Hartmann-Petersen

Summary: Catechol-O-methyltransferase (COMT) is an important enzyme involved in the metabolism of neurotransmitters and catecholamine drugs, and its variation can affect pharmacokinetics and drug availability.

BIOCHEMISTRY (2023)

Review Biotechnology & Applied Microbiology

Folding of heterologous proteins in bacterial cell factories: Cellular mechanisms and engineering strategies

Yixin Rong, Sheila Ingemann Jensen, Kresten Lindorff-Larsen, Alex Toftgaard Nielsen

Summary: The expression of correctly folded and functional heterologous proteins is crucial in biotechnological production processes. Bacterial platform organisms like E. coli are commonly used due to their proven suitability at an industrial scale, but can suffer from protein aggregation and low functional protein levels. This review explores cellular mechanisms influencing protein folding and expression across different organisms, and discusses experimental methods to improve protein folding, such as codon optimization and chaperone co-production.

BIOTECHNOLOGY ADVANCES (2023)

Article Biochemistry & Molecular Biology

Lysine deserts prevent adventitious ubiquitylation of ubiquitin-proteasome components

Caroline Kampmeyer, Martin Gronbaek-Thygesen, Nicole Oelerich, Michael H. Tatham, Matteo Cagiada, Kresten Lindorff-Larsen, Wouter Boomsma, Kay Hofmann, Rasmus Hartmann-Petersen

Summary: Lysine is a common amino acid in the human proteome, but there are proteins that lack lysine residues. These lysine deserts are common in intrinsically disordered proteins involved in the ubiquitin-proteasome system. Introducing lysine residues can increase ubiquitylation of these proteins, and their stability and function may be affected. This avoidance of lysine residues may be an evolutionary mechanism to prevent unnecessary ubiquitylation in proteins closely involved with the ubiquitylation machinery.

CELLULAR AND MOLECULAR LIFE SCIENCES (2023)

Article Biochemistry & Molecular Biology

Global Analysis of Multi-Mutants to Improve Protein Function

Kristoffer E. Johansson, Kresten Lindorff-Larsen, Jakob R. Winther

Summary: Identifying amino acid substitutions that improve both stability and function of a protein is a challenge in protein engineering. The Global Multi-Mutant Analysis (GMMA) method is used to identify beneficial substitutions across a large library of protein variants by analyzing multiply-substituted variants. Experimental results showed that the top-ranking substitutions progressively enhanced the function of GFP. Large libraries of multiply-substituted variants could provide valuable information for protein engineering.

JOURNAL OF MOLECULAR BIOLOGY (2023)

Letter Biochemistry & Molecular Biology

Comment on Intrinsic protein disorder uncouples affinity from binding specificity

Kaare Teilum, Johan G. Olsen, Birthe B. Kragelund

PROTEIN SCIENCE (2023)

Article Biochemistry & Molecular Biology

Slow conformational changes in the rigid and highly stable chymotrypsin inhibitor 2

Yulian Gavrilov, Andreas Prestel, Kresten Lindorff-Larsen, Kaare Teilum

Summary: Slow conformational changes are important for protein function, but their impact on the overall folding stability is not well understood. This study investigates the effects of L49I and I57V substitutions on the slow conformational dynamics of CI2. The results show that these substitutions have minimal impact on the structure of the excited state, but the stability of the excited state is influenced by the stability of the main state. The interactions between substituted residues and water molecules play a role in linking subtle structural changes to slow conformational changes in the protein.

PROTEIN SCIENCE (2023)

Article Biology

The prolactin receptor scaffolds Janus kinase 2 via co-structure formation with phosphoinositide-4,5-bisphosphate

Raul Araya-Secchi, Katrine Bugge, Pernille Seiffert, Amalie Petry, Gitte W. Haxholm, Kresten Lindorff-Larsen, Stine Falsig Pedersen, Lise Arleth, Birthe B. Kragelund

Summary: This study investigates the role of lipids in PRLR signaling and finds that co-structure formation locks the disordered domain of PRLR in an extended structure, enabling signal relay.
Article Biochemistry & Molecular Biology

Phosphorylation of Schizosaccharomyces pombe Dss1 mediates direct binding to the ubiquitin-ligase Dma1 in vitro

Nina L. Jacobsen, Magnus Bloch, Peter S. Millard, Sarah F. Ruidiaz, Jonas D. Elsborg, Wouter Boomsma, Ruth Hendus-Altenburger, Rasmus Hartmann-Petersen, Birthe B. Kragelund

Summary: This study found that Schizosaccharomyces pombe Dss1 is phosphorylated by casein kinase 2 at three threonine sites in its linker region. The phosphorylation does not affect its ubiquitin binding ability, but slightly destabilizes the C-terminal alpha-helix and directly interacts with the forkhead-associated domain of the RING-FHA E3-ubiquitin ligase defective in mitosis 1 (Dma1). These phosphorylation sites are absent in human Dss1.

PROTEIN SCIENCE (2023)

Correction Multidisciplinary Sciences

Polyelectrolyte interactions enable rapid association and dissociation in high-affinity disordered protein complexes (vol 11, 5736, 2020)

Andrea Sottini, Alessandro Borgia, Madeleine B. Borgia, Katrine Bugge, Daniel Nettels, Aritra Chowdhury, Petur O. Heidarsson, Franziska Zosel, Robert B. Best, Birthe B. Kragelund, Benjamin Schuler

NATURE COMMUNICATIONS (2023)

Review Cell Biology

The molecular basis for cellular function of intrinsically disordered protein regions

Alex S. Holehouse, Birthe B. Kragelund

Summary: Intrinsically disordered protein regions, lacking a stable 3D structure, are structurally heterogeneous and widely present in all kingdoms of life. Despite their lack of a defined structure, these regions play essential roles in cellular processes and can be regulated by their structural and chemical context. Recent studies have advanced our understanding of the link between protein sequence and conformational behavior in disordered regions, but the connection between sequence and molecular function is still not well defined.

NATURE REVIEWS MOLECULAR CELL BIOLOGY (2023)

Article Biochemistry & Molecular Biology

Mycobacterium tuberculosis Ku Stimulates Multi-round DNA Unwinding by UvrD1 Monomers

Ankita Chadda, Alexander G. Kozlov, Binh Nguyen, Timothy M. Lohman, Eric A. Galburt

Summary: In this study, it was found that the DNA damage response in Mycobacterium tuberculosis differs from well-studied model bacteria. The DNA repair helicase UvrD1 in Mtb is activated through a redox-dependent process and is closely associated with the homo-dimeric Ku protein. Additionally, Ku protein is shown to stimulate the helicase activity of UvrD1.

JOURNAL OF MOLECULAR BIOLOGY (2024)