4.7 Article

T-REKS: identification of Tandem REpeats in sequences with a K-meanS based algorithm

期刊

BIOINFORMATICS
卷 25, 期 20, 页码 2632-2638

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btp482

关键词

-

资金

  1. Ministere de l'Education Nationale, de la Recherche et de la Technologie (MENRT)

向作者/读者索取更多资源

Motivation: Over the last years a number of evidences have been accumulated about high incidence of tandem repeats in proteins carrying fundamental biological functions and being related to a number of human diseases. At the same time, frequently, protein repeats are strongly degenerated during evolution and, therefore, cannot be easily identified. To solve this problem, several computer programs which were based on different algorithms have been developed. Nevertheless, our tests showed that there is still room for improvement of methods for accurate and rapid detection of tandem repeats in proteins. Results: We developed a new program called T-REKS for ab initio identification of the tandem repeats. It is based on clustering of lengths between identical short strings by using a K-means algorithm. Benchmark of the existing programs and T-REKS on several sequence datasets is presented. Our program being linked to the Protein Repeat DataBase opens the way for large-scale analysis of protein tandem repeats. T-REKS can also be applied to the nucleotide sequences.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biochemical Research Methods

Disentangling the complexity of low complexity proteins

Pablo Mier, Lisanna Paladin, Stella Tamana, Sophia Petrosian, Borbala Hajdu-Soltesz, Annika Urbanek, Aleksandra Gruca, Dariusz Plewczynski, Marcin Grynberg, Pau Bernado, Zoltan Gaspari, Christos A. Ouzounis, Vasilis J. Promponas, Andrey V. Kajava, John M. Hancock, Silvio C. E. Tosatto, Zsuzsanna Dosztanyi, Miguel A. Andrade-Navarro

BRIEFINGS IN BIOINFORMATICS (2020)

Article Biochemistry & Molecular Biology

DisProt: intrinsic protein disorder annotation in 2020

Andras Hatos, Borbala Hajdu-Soltesz, Alexander M. Monzon, Nicolas Palopoli, Lucia Alvarez, Burcu Aykac-Fas, Claudio Bassot, Guillermo Benitez, Martina Bevilacqua, Anastasia Chasapi, Lucia Chemes, Norman E. Davey, Radoslav Davidovic, A. Keith Dunker, Arne Elofsson, Julien Gobeill, Nicolas S. Gonzalez Foutel, Govindarajan Sudha, Mainak Guharoy, Tamas Horvath, Valentin Iglesias, Andrey Kajava, Orsolya P. Kovacs, John Lamb, Matteo Lambrughi, Tamas Lazar, Jeremy Y. Leclercq, Emanuela Leonardi, Sandra Macedo-Ribeiro, Mauricio Macossay-Castillo, Emiliano Maiani, Jose A. Manso, Cristina Marino-Buslje, Elizabeth Martinez-Perez, Balint Meszaros, Ivan Micetic, Giovanni Minervini, Nikoletta Murvai, Marco Necci, Christos A. Ouzounis, Matyas Pajkos, Lisanna Paladin, Rita Pancsa, Elena Papaleo, Gustavo Parisi, Emilie Pasche, Pedro J. Barbosa Pereira, Vasilis J. Promponas, Jordi Pujols, Federica Quaglia, Patrick Ruch, Marco Salvatore, Eva Schad, Beata Szabo, Tamas Szaniszlo, Stella Tamana, Agnes Tantos, Nevena Veljkovic, Salvador Ventura, Wim Vranken, Zsuzsanna Dosztanyi, Peter Tompa, Silvio C. E. Tosatto, Damiano Piovesan

NUCLEIC ACIDS RESEARCH (2020)

Article Genetics & Heredity

Opposite Modulation of RAC1 by Mutations in TRIO Is Associated with Distinct, Domain-Specific Neurodevelopmental Disorders

Sonia Barbosa, Stephanie Greville-Heygate, Maxime Bonnet, Annie Godwin, Christine Fagotto-Kaufmann, Andrey Kajava, Damien Laouteouet, Rebecca Mawby, Htoo Aung Wai, Alexander J. M. Dingemans, Jayne Hehir-Kwa, Marjorlaine Willems, Yline Capri, Sarju G. Mehta, Helen Cox, David Goudie, Fleur Vansenne, Peter Turnpenny, Marie Vincent, Benjamin Cogne, Gaetan Lesca, Jozef Hertecant, Diana Rodriguez, Boris Keren, Lydie Burglen, Marion Gerard, Audrey Putoux, Vincent Cantagrel, Karine Siquier-Pernet, Marlene Rio, Siddharth Banka, Ajoy Sarkar, Marcie Steeves, Michael Parker, Emma Clement, Sebastien Moutton, Frederic Tran Mau-Them, Amelie Piton, Bert B. A. de Vries, Matthew Guille, Anne Debant, Susanne Schmidt, Diana Baralle

AMERICAN JOURNAL OF HUMAN GENETICS (2020)

Article Chemistry, Physical

Point mutations affecting yeast prion propagation change the structure of its amyloid fibrils

Anna Sulatskaya, Stanislav A. Bondarev, Maksim Sulatsky, Nina P. Trubitsina, Mikhail Belousov, Galina A. Zhouravleva, Manuel A. Llanos, Andrey Kajava, Irina M. Kuznetsova, Konstantin K. Turoverov

JOURNAL OF MOLECULAR LIQUIDS (2020)

Article Biochemistry & Molecular Biology

Amyloidogenicity as a driving force for the formation of functional oligomers

Rafayel A. Azizyan, Weiqiang Wang, Alexey Anikeenko, Zinaida Radkova, Anastasia Bakulina, Adriana Garro, Landry Charlier, Christian Dumas, Salvador Ventura, Andrey Kajava

JOURNAL OF STRUCTURAL BIOLOGY (2020)

Article Medicine, General & Internal

Modeling polymorphic ventricular tachycardia at rest using patient-specific induced pluripotent stem cell-derived cardiomyocytes

Yvonne Sleiman, Monia Souidi, Ritu Kumar, Ellen Yang, Fabrice Jaffre, Ting Zhou, Albin Bernardin, Steve Reiken, Olivier Cazorla, Andrey V. Kajava, Adrien Morea, Jean-Luc Pasquie, Andrew R. Marks, Bruce B. Lerman, Shuibing Chen, Jim W. Cheung, Todd Evans, Alain Lacampagne, Albano C. Meli

EBIOMEDICINE (2020)

Article Biochemistry & Molecular Biology

RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures

Lisanna Paladin, Martina Bevilacqua, Sara Errigo, Damiano Piovesan, Ivan Micetic, Marco Necci, Alexander Miguel Monzon, Maria Laura Fabre, Jose Luis Lopez, Juliet F. Nilsson, Javier Rios, Pablo Lorenzano Menna, Maia Cabrera, Martin Gonzalez Buitron, Mariane Goncalves Kulik, Sebastian Fernandez-Alberti, Maria Silvina Fornasari, Gustavo Parisi, Antonio Lagares, Layla Hirsh, Miguel A. Andrade-Navarro, Andrey Kajava, Silvio C. E. Tosatto

Summary: RepeatsDB database provides annotations and classification for protein tandem repeat structures from PDB. The new version 3.0 addresses challenges of data growth and annotation needs by introducing a hierarchical classification scheme.

NUCLEIC ACIDS RESEARCH (2021)

Article Engineering, Chemical

Trimeric SARS-CoV-2 Spike Proteins Produced from CHO Cells in Bioreactors Are High-Quality Antigens

Paco Pino, Joeri Kint, Divor Kiseljak, Valentina Agnolon, Giampietro Corradin, Andrey V. Kajava, Paolo Rovero, Ronald Dijkman, Gerco den Hartog, Jason S. McLellan, Patrick O. Byrne, Maria J. Wurm, Florian M. Wurm

PROCESSES (2020)

Article Biochemical Research Methods

Critical assessment of protein intrinsic disorder prediction

Marco Necci, Damiano Piovesan, Silvio C. E. Tosatto

Summary: Intrinsically disordered proteins present a challenge to traditional protein structure-function analysis, with computational methods, particularly deep learning techniques, showing superior performance in predicting disorder. However, predicting disordered binding regions remains difficult, and there is a significant variation in computational times among methods.

NATURE METHODS (2021)

Article Multidisciplinary Sciences

Δ133p53β isoform pro-invasive activity is regulated through an aggregation-dependent mechanism in cancer cells

Nikola Arsic, Tania Slatter, Gilles Gadea, Etienne Villain, Aurelie Fournet, Marina Kazantseva, Frederic Allemand, Nathalie Sibille, Martial Seveno, Sylvain de Rossi, Sunali Mehta, Serge Urbach, Jean-Christophe Bourdon, Pau Bernado, Andrey Kajava, Antony Braithwaite, Pierre Roux

Summary: The p53 isoform Delta 133p53 beta promotes intrinsic oncogenic functions, with its activity regulated through an aggregation-dependent mechanism. Interaction with partners like p63 family members or the CCT chaperone complex influences cancer cell features such as migration and invasion by modulating Delta 133p53 beta activity.

NATURE COMMUNICATIONS (2021)

Article Immunology

Immunoreactivity of Sera From Low to Moderate Malaria-Endemic Areas Against Plasmodium vivax rPvs48/45 Proteins Produced in Escherichia coli and Chinese Hamster Ovary Systems

Myriam Arevalo-Herrera, Kazutoyo Miura, Nora Cespedes, Carlos Echeverry, Eduardo Solano, Angelica Castellanos, Juan Sebastian Ramirez, Adolfo Miranda, Andrey V. Kajava, Carole Long, Giampietro Corradin, Socrates Herrera

Summary: The P48/45 antigen, a crucial factor in Plasmodium parasite fertilization, was found to be more immunoreactive when expressed in Chinese Hamster Ovary (CHO) cells compared to Escherichia coli, showing potential for use in a protein vaccine. While there was an age-dependent increase in response to both antigens, specific IgG antibodies to CHO-rPvs48/45 demonstrated functional activity in inhibiting parasite transmission, suggesting promising prospects for further research.

FRONTIERS IN IMMUNOLOGY (2021)

Article Biochemistry & Molecular Biology

Identification of a Region in the Common Amino-terminal Domain of Hendra Virus P, V, and W Proteins Responsible for Phase Transition and Amyloid Formation

Edoardo Salladini, Frank Gondelaud, Juliet F. Nilsson, Giulia Pesce, Christophe Bignon, Maria Grazia Murrali, Roxane Fabre, Roberta Pierattelli, Andrey Kajava, Branka Horvat, Denis Gerlier, Cyrille Mathieu, Sonia Longhi

Summary: Henipaviruses are zoonotic pathogens responsible for severe encephalitis in humans. Their V protein plays a key role in immune evasion and has been shown to undergo a liquid-to-hydrogel phase transition. A specific region within the Hendra virus V protein, referred to as PNT3, forms amyloid-like fibrils, highlighting the potential importance of phase separation and fibrillation in the functional role of Henipavirus V proteins.

BIOMOLECULES (2021)

Article Biochemistry & Molecular Biology

The Difference in Structural States between Canonical Proteins and Their Isoforms Established by Proteome-Wide Bioinformatics Analysis

Zarifa Osmanli, Theo Falgarone, Turkan Samadova, Gudrun Aldrian, Jeremy Leclercq, Ilham Shahmuradov, Andrey Kajava

Summary: Alternative splicing is an important mechanism for generating protein diversity in cells. However, there is still a lack of structural data on alternative protein isoforms, as experimental studies typically focus on canonical proteins. In recent years, advances in bioinformatics tools and the development of the AlphaFold program have allowed for the modeling of high-confidence structures of isoforms. In this study, in silico analysis of 58 eukaryotic proteomes was performed, revealing differences in signal peptides, transmembrane regions, and tandem repeat regions between isoforms and canonical counterparts, potentially impacting protein function and cellular localization.

BIOMOLECULES (2022)

Article Biochemistry & Molecular Biology

Molecular Determinants of Fibrillation in a Viral Amyloidogenic Domain from Combined Biochemical and Biophysical Studies

Juliet F. F. Nilsson, Hakima Baroudi, Frank Gondelaud, Giulia Pesce, Christophe Bignon, Denis Ptchelkine, Joseph Chamieh, Herve Cottet, Andrey V. V. Kajava, Sonia Longhi

Summary: The Nipah and Hendra viruses, classified as Henipaviruses, are highly dangerous human pathogens that counteract the host immune response. A recent study found that a short region within the shared N-terminal domain (PNT3) of the Phosphoprotein (P protein) can form amyloid-like structures. This study evaluated the role of specific tyrosine residues within this region in fibrillation. Results showed that removal of a single tyrosine significantly decreases fibril formation, mainly affecting the elongation phase, and the C-terminal half of PNT3 inhibits fibril formation. The study sheds light on the molecular mechanisms of fibril formation.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2023)

Article Biochemistry & Molecular Biology

Estimation of amyloid aggregate sizes with semi-denaturing detergent agarose gel electrophoresis and its limitations

Polina B. Drozdova, Yury A. Barbitoff, Mikhail V. Belousov, Rostislav K. Skitchenko, Tatyana M. Rogoza, Jeremy Y. Leclercq, Andrey V. Kajava, Andrew G. Matveenko, Galina A. Zhouravleva, Stanislav A. Bondarev

暂无数据