4.7 Article

SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics

期刊

BIOINFORMATICS
卷 31, 期 15, 页码 2489-2496

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btv185

关键词

-

资金

  1. German Research Foundation (DFG) [BA 2168/3-3, BA 2168/4-3 SPP 1395 InKoMBio, MO 2402/1-1]
  2. German Federal Ministry of Education and Research (BMBF) [031 6165A]

向作者/读者索取更多资源

Motivation: RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of O(n(6)). Subsequently, numerous faster 'Sankoff-style' approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based methods breaks down for RNAs with sequence identities below 60%. Alignment approaches like LocARNA that do not require sequence-based heuristics, have been limited to high complexity (>= quartic time). Results: Breaking this barrier, we introduce the novel Sankoff-style algorithm 'sparsified prediction and alignment of RNAs based on their structure ensembles (SPARSE)', which runs in quadratic time without sequence-based heuristics. To achieve this low complexity, on par with sequence alignment algorithms, SPARSE features strong sparsification based on structural properties of the RNA ensembles. Following PMcomp, SPARSE gains further speed-up from lightweight energy computation. Although all existing lightweight Sankoff-style methods restrict Sankoff's original model by disallowing loop deletions and insertions, SPARSE transfers the Sankoff algorithm to the lightweight energy model completely for the first time. Compared with LocARNA, SPARSE achieves similar alignment and better folding quality in significantly less time (speedup: 3.7). At similar run-time, it aligns low sequence identity instances substantially more accurate than RAF, which uses sequence-based heuristics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biochemical Research Methods

Peakhood: individual site context extraction for CLIP-seq peak regions

Michael Uhl, Dominik Rabsch, Florian Eggenhofer, Rolf Backofen

Summary: This article introduces Peakhood, the first tool that utilizes CLIP-seq peak regions, CLIP-seq read information, and genomic annotations to determine the context for each peak region and determine the most probable splice variant, resulting in a comprehensive collection of transcript context binding sites.

BIOINFORMATICS (2022)

Article Biochemical Research Methods

Bi-alignments with affine gaps costs

Peter F. Stadler, Sebastian Will

Summary: This study introduces the concept of bi-alignments, which can be used to align sequence and structure similarity simultaneously. Using affine cost bi-alignments, efficient alignments of large proteins can be computed.

ALGORITHMS FOR MOLECULAR BIOLOGY (2022)

Article Chemistry, Multidisciplinary

Galaxy workflows for fragment-based virtual screening: a case study on the SARS-CoV-2 main protease

Simon Bray, Tim Dudgeon, Rachael Skyner, Rolf Backofen, Bjorn Gruning, Frank von Delft

Summary: Several workflows for protein-ligand docking and free energy calculation in Galaxy, a workflow management system, are presented. These workflows incorporate widely used open-source tools such as rDock and GROMACS, and can be executed either through Galaxy's graphical interface or the command line on public infrastructure. The utility of these workflows is demonstrated by conducting a high-throughput virtual screening of approximately 50000 compounds against the SARS-CoV-2 main protease, a system of extensive study in the past year.

JOURNAL OF CHEMINFORMATICS (2022)

Article Microbiology

Spacer prioritization in CRISPR-Cas9 immunity is enabled by the leader RNA

Chunyu Liao, Sahil Sharma, Sarah L. Svensson, Anuja Kibe, Zasha Weinberg, Omer S. Alkhnbashi, Thorsten Bischler, Rolf Backofen, Neva Caliskan, Cynthia M. Sharma, Chase L. Beisel

Summary: CRISPR-Cas systems store foreign DNA fragments as immunological recordings to combat infections. The newest spacers stored in the system are prioritized for immune defense and this process involves the interaction between the leader region and the conserved repeats bordering the newest spacer, leading to accelerated crRNA processing.

NATURE MICROBIOLOGY (2022)

Article Biochemical Research Methods

CRISPRtracrRNA: robust approach for CRISPR tracrRNA detection

Alexander Mitrofanov, Marcus Ziemann, Omer S. Alkhnbashi, Wolfgang R. Hess, Rolf Backofen

Summary: This study introduces a new pipeline, CRISPRtracrRNA, for screening and evaluating tracrRNA candidates in genomes. The pipeline combines evidence from different components of the Cas9-sgRNA complex. It also utilizes a newly developed structural model to simulate the structure of tracrRNA. Additionally, evidence is provided through the detection of repeat sequences, terminator signals, and RNA-RNA interactions.

BIOINFORMATICS (2022)

Article Multidisciplinary Sciences

The long noncoding RNA mimi scaffolds neuronal granules to maintain nervous system maturity

Dominika Grzejda, Jana Mach, Johanna Aurelia Schweizer, Barbara Hummel, Andrew Mischa Rezansoff, Florian Eggenhofer, Amol Panhale, Maria-Eleni Lalioti, Nina Cabezas Wallscheid, Rolf Backofen, Johannes Felsenberg, Valerie Hilgers

Summary: This study identifies a long noncoding RNA, mimi, as a scaffold for large neuronal granules in the adult nervous system. Neuronal ELAV-like proteins directly bind mimi and mediate granule assembly, while Staufen maintains condensate integrity. mimi granules contain mRNAs and proteins involved in synaptic processes, and their loss impairs nervous system maturity and neuropeptide-mediated signaling, leading to neurodegeneration.

SCIENCE ADVANCES (2022)

Article Instruments & Instrumentation

Jet-loaded cold atomic beam source for strontium

Minho Kwon, Aaron Holman, Quan Gan, Chun-Wei Liu, Matthew Molinelli, Ian Stevenson, Sebastian Will

Summary: We present a design and characterization of a cold atom source for strontium (Sr) using a two-dimensional magneto-optical trap (MOT) that is loaded directly from the atom jet of a dispenser. The atom flux of the source is characterized by measuring the loading rate of a three-dimensional MOT, which reaches loading rates of up to 10^8 atoms per second. This compact and low-power consumption setup addresses the challenge of reducing complexity in cold beam sources for Sr, making it important for applications in optical atomic clocks, quantum simulation, and computing devices based on ultracold Sr.

REVIEW OF SCIENTIFIC INSTRUMENTS (2023)

Article Astronomy & Astrophysics

Star Formation Laws and Efficiencies across 80 Nearby Galaxies

Jiayi Sun, Adam K. Leroy, Eve C. Ostriker, Sharon Meidt, Erik Rosolowsky, Eva Schinnerer, Christine D. Wilson, Dyas Utomo, Francesco Belfiore, Guillermo A. Blanc, Eric Emsellem, Christopher Faesi, Brent Groves, Annie Hughes, Eric W. Koch, Kathryn Kreckel, Daizhong Liu, Hsi-An Pan, Jerome Pety, Miguel Querejeta, Alessandro Razza, Toshiki Saito, Amy Sardone, Antonio Usero, Thomas G. Williams, Frank Bigiel, Alberto D. Bolatto, Melanie Chevance, Daniel A. Dale, Jindra Gensior, Simon C. O. Glover, Kathryn Grasha, Jonathan D. Henshaw, Maria J. Jimenez-Donaire, Ralf S. Klessen, J. M. Diederik Kruijssen, Eric J. Murphy, Lukas Neumann, Yu-Hsuan Teng, David A. Thilker

Summary: We measured the empirical relationships between local star formation rate and properties of the star-forming molecular gas in 80 nearby galaxies. These relationships, known as star formation laws, aim to predict the local SFR surface density using different combinations of molecular gas surface density, galactic orbital time, molecular cloud free fall time, and interstellar medium dynamical equilibrium pressure. Our results show that these relationships have intrinsic scatter and the slope of the molecular Kennicutt-Schmidt relation remains roughly constant across different environments. The other relationships show variations in their slopes, suggesting systematic changes in star formation efficiency and pressure-to-SFR surface density ratio.

ASTROPHYSICAL JOURNAL LETTERS (2023)

Review Cardiac & Cardiovascular Systems

The challenges of research data management in cardiovascular science: a DGK and DZHK position paper-executive summary

Sabine Steffens, Katrin Schroeder, Martina Krueger, Christoph Maack, Katrin Streckfuss-Boemeke, Johannes Backs, Rolf Backofen, Bettina Baessler, Yvan Devaux, Ralf Gilsbach, Jordi Heijman, Jochen Knaus, Rafael Kramann, Dominik Linz, Allyson L. Lister, Henrike Maatz, Lars Maegdefessel, Manuel Mayr, Benjamin Meder, Sara Y. Nussbeck, Eva A. Rog-Zielinska, Marcel H. Schulz, Albert Sickmann, Goekhan Yigit, Peter Kohl

Summary: Sharing and documentation of cardiovascular research data are essential but challenges like lack of time, incentives, funding, standardization, and legal understanding exist. More tools, education, and training are needed for effective data sharing and an open science culture. Long-term effort is required for FAIR RDM.

CLINICAL RESEARCH IN CARDIOLOGY (2023)

Article Biology

An accessible infrastructure for artificial intelligence using a Docker-based JupyterLab in Galaxy

Anup Kumar, Gianmauro Cuccuru, Bjoern Gruening, Rolf Backofen

Summary: An open-source, docker-based, and GPU-enabled JupyterLab infrastructure is developed that runs on the public compute infrastructure of Galaxy Europe. It enables rapid prototyping and development of end-to-end AI projects, including large-scale data training and remote AI model training execution.

GIGASCIENCE (2023)

Article Biochemistry & Molecular Biology

Interrogating two extensively self-targeting Type I CRISPR-Cas systems in Xanthomonas albilineans reveals distinct anti-CRISPR proteins that block DNA degradation

Franziska Wimmer, Frank Englert, Katharina G. Wandera, Omer S. Alkhnbashi, Scott P. Collins, Rolf Backofen, Chase L. Beisel

Summary: This study investigates extensive self-targeting in the plant pathogen Xanthomonas albilineans by two CRISPR-Cas systems and identifies two Acrs proteins that inhibit the activity of Cas3, expanding the known suite of DNA degradation-inhibiting Acrs.

NUCLEIC ACIDS RESEARCH (2023)

Article Biochemistry & Molecular Biology

Heterogenous nuclear ribonucleoprotein D-like controls endothelial cell functions

Sandra Fischer, Chiara Lichtenthaeler, Anastasiya Stepanenko, Florian Heyl, Daniel Maticzka, Katrin Kemmerer, Melina Klostermann, Rolf Backofen, Kathi Zarnack, Julia E. Weigand

Summary: HNRNPDL plays an important role in endothelial cell functions, and its knockdown affects cell proliferation, migration, and sprouting ability, possibly through regulating the expression of specific genes.

BIOLOGICAL CHEMISTRY (2023)

Article Biochemistry & Molecular Biology

Improved discovery of RNA-binding protein binding sites in eCLIP data using DEWSeq

Thomas Schwarzl, Sudeep Sahadevan, Benjamin Lang, Milad Miladi, Rolf Backofen, Wolfgang Huber, Matthias W. Hentze, Gian Gaetano Tartaglia

Summary: Enhanced crosslinking and immunoprecipitation sequencing (eCLIP-seq) is a method for detecting RNA-binding protein binding sites. However, current analysis strategies have low replication and high false positive rates. DEWSeq, a R/Bioconductor package, improves the detection of binding regions by utilizing replicate information and size-matched input controls. It has been shown to significantly increase the number and quality of binding sites.

NUCLEIC ACIDS RESEARCH (2023)

Article Biology

Loop detection using Hi-C data with HiCExplorer

Joachim Wolff, Rolf Backofen, Bjoern Gruening

Summary: This study presents an algorithm for detecting chromatin loops using continuous negative binomial distributions and the HiCCUPS method, which has been integrated into the HiCExplorer software. The method achieves high detection rate and accuracy, and is the fastest CPU implementation currently available.

GIGASCIENCE (2022)

Article Optics

Laser cooling scheme for the carbon dimer (12C2)

Niccolo Bigagli, Daniel W. Savin, Sebastian Will

Summary: We present a scheme for laser cooling of C-12(2) and provide calculations for the branching ratios of cycling and repumping transitions. Our results show that C-2 cooling, using specific bands, is achievable under realistic experimental conditions. This work opens up possibilities for cooling molecules with carbon-carbon bonds and potentially enables quantum control of organic molecules.

PHYSICAL REVIEW A (2022)

暂无数据